THEORY AND DECISION LIBRARY 



SERIES B: MATHEMATICAL AND STATISTICAL METHODS 



DIRK HELBING 



QUANTITATIVE 

SOCIODYNAMICS 

Stochastic Methods and Models 
of Social Interaction Processes 



Springer Science+Business Media* LLC 



QUANTITATIVE SOCIODYNAMICS 




THEORY AND DECISION LIBRARY 



General Editors : W. Leinfellner ( Vienna ) and G. Eberlein ( Munich ) 



Series A: Philosophy and Methodology of the Social Sciences 
Series B: Mathematical and Statistical Methods 

Series C: Game Theory, Mathematical Programming and Operations Research 
Series D: System Theory, Knowledge Engineering and Problem Solving 



SERIES B: MATHEMATICAL AND STATISTICAL METHODS 

VOLUME 31 



Editor: H. J. Skala (Paderbom); Assistant Editor. M. Kraft (Paderbom); Editorial Board : 
J. Acz61 (Waterloo, Ont.), G. Bamberg (Augsburg), H. Drygas (Kassel), W. Eichhom 
(Karlsruhe), P. Fishbum (Murray Hill, N.J.), D. Fraser (Toronto), W. Janko (Vienna), P. de 
Jong (Vancouver), T. Kariya (Tokyo), M. Machina (La Jolla, Calif.), A. Rapoport (Toronto), 
M. Richter (Kaiserslautern), B. K. Sinha (Cattonsville, Md.), D. A. Sprott (Waterloo, Ont.), 
P. Suppes (Stanford, Calif.), H. Theil (St. Augustine, Fla.), E. Trillas (Madrid), L. A. Zadeh 
(Berkeley, Calif.). 



Scope: The series focuses on the application of methods and ideas of logic, mathematics and 
statistics to the social sciences. In particular, formal treatment of social phenomena, the 
analysis of decision making, information theory and problems of inference will be central 
themes of this part of the library. Besides theoretical results, empirical investigations and the 
testing of theoretical models of real world problems will be subjects of interest. In addition 
to emphasizing interdisciplinary communication, the series will seek to support the rapid 
dissemination of recent results. 



The titles published in this series are listed at the end of this volume. 







QUANTITATIVE 

SOCIODYNAMICS 



Stochastic Methods and Models 
of Social Interaction Processes 



by 

DIRK HELBING 

II. Institute of Theoretical Physics, Stuttgart 



Translated from German by 
Richard Calek and Dirk Helbing 




Springer-Science+Business Media, B.V. 




Library of Congress Cataloging-in-Publication Data 

Helbing, Dirk. 

Quantitative soc i odynara 1 cs : stochastic methods and models of 
social Interaction processes ! Dirk Helbing. 

p. cm. — (Theory and decision library. Series B, 
Mathematical and statistical methods ; v. 31) 

Includes Index. 

(HB : acid-free paper) 

1. Social Interact Ion— Mathemat lea 1 models. 2. Stochastic 
processes. I. Title. II. Series. 

HM291.H487 1995 

302' .01 ’51— dc20 94-35633 

ISBN 978-90-481-4482-2 ISBN 978-94-015-8516-3 (eBook) 

DOI 10.1007/978-94-015-8516-3 



Printed on acid-free paper 



All Rights Reserved 

© 1995 Springer Science+Business Media Dordrecht 
Originally published by Kluwer Academic Publishers in 1995. 
Softcover reprint of the hardcover 1st edition 1995 
No part of the material protected by this copyright notice may be reproduced or 
utilized in any form or by any means, electronic or mechanical, 
including photocopying, recording or by any information storage and 
retrieval system, without written permission from the copyright owner. 



Table of Contents 



Preface xiii 

List of Figures XV 

List of Tables xix 

Introduction and Summary 1 

Quantitative Models in the Social Sciences 2 

The Logistic Model 2 

Diffusion Models 3 

The Gravity Model 3 

The Game Theory 3 

Decision Models 5 

How to Describe Social Processes in a Mathematical Way 7 

Statistical Physics and Stochastic Methods 7 

Non-Linear Dynamics 13 

I Stochastic Methods and Non-Linear Dynamics 17 

Overview 19 

1 Master Equation in State Space 21 

1.1 Introduction 21 

1.2 Derivation 23 

1.2.1 Derivation from the Markov Property 24 

1.2.2 External Influences (Disturbances) 26 



V 




VI 



Table of Contents 



1.2.3 Internal Fluctuations 26 

1.2.4 Derivation from Quantum Mechanics 29 

1.3 Properties 36 

1.3.1 Normalization 36 

1.3.2 Non-Negativity 36 

1.3.3 The Liouville Representation 37 

1.3.4 Eigenvalues 38 

1.3.5 Convergence to the Stationary Solution 39 

1.4 Solution Methods 40 

1.4.1 Stationary Solution and Detailed Balance 40 

1.4.2 Time-Dependent Solution . 42 

1.4.3 ‘Path Integral’ Solution 43 

1.5 Mean Value and Covariance Equations 50 

2 BoLTZMANN-Like Equations 53 

2.1 Introduction 53 

2.2 Derivation 54 

2.3 Subdivision into Several Types of Subystems 57 

2.4 Properties 58 

2.4.1 Non-Negativity and Normalization 58 

2.4.2 The Gaskinetic Boltzmann Equation 58 

2.4.3 The //-Theorem for the Gaskinetic Boltzmann Equation 61 

2.4.4 Solution of the Gaskinetic Boltzmann Equation .... 63 

2.5 Comparison of Spontaneous Transitions and Direct Interactions 64 

2.5.1 Transitions Induced by Interactions 65 

2.5.2 Exponential Function und Logistic Equation 65 

2.5.3 Stationary and Oscillatory Solutions 67 




Table of Contents vii 

3 Master Equation in Configuration Space 69 

3.1 Introduction 69 

3.2 Transitions in Configuration Space 70 

3.2.1 Spontaneous Transitions 70 

3.2.2 Pair Interactions 71 

3.3 Mean Value and Covariance Equations 72 

3.4 Corrections and Higher Order Interactions 76 

3.5 Indirect Interactions and Mean Field Approaches 80 

3.6 Comparison of Direct and Indirect Interactions 81 

3.6.1 Differences Concerning the Covariance Equations .... 81 

3.6.2 Differences Concerning the Mean Value Equations ... 82 

4 The Fokker-Planck Equation 83 

4.1 Introduction 83 

4.2 Derivation 83 

4.3 Properties 87 

4.3.1 The Continuity Equation 87 

4.3.2 Normalization 87 

4.3.3 The Liouville Representation 88 

4.3.4 Non-Negativity 89 

4.3.5 Eigenvalues 89 

4.3.6 Convergence to the Stationary Solution 89 

4.4 Solution Methods 90 

4.4.1 Stationary Solution 90 

4.4.2 Path Integral Solution 91 

4.4.3 Interrelation with the SCHRODINGER Equation 92 

4.5 Mean Value and Covariance Equations 93 

4.5.1 Interpretation of the Jump Moments 94 

4.6 Boltzmann-Fokker-Planck Equations 95 

4.6.1 Self-Consistent Solution 99 




viii Table of Contents 

5 Langevin Equations and Non-Linear Dynamics 103 

5.1 Introduction 103 

5.2 Derivation 105 

5.3 Escape Time 109 

5.4 Phase Transitions, Liapunov Exponents and Critical Phenomenalll 

5.5 Routes to Chaos 113 

5.5.1 Ruelle-Takens-Newhouse Scenario and Liapunov 

Exponents 115 

5.5.2 Period Doubling Scenario and Power Spectra 116 

II Quantitative Models of Social Processes 119 

Overview 121 

6 Problems and Terminology 123 

6.1 Terms 123 

6.1.1 System and Subsystems 123 

6.1.2 State 123 

6.1.3 Subpopulation 123 

6.1.4 Socioconfiguration 126 

6.1.5 Interaction 126 

6.2 Problems with Modelling Social Processes 127 

6.2.1 Complexity 127 

6.2.2 Individuality 129 

6.2.3 Stochasticity and Disturbances 130 

6.2.4 Decisions and Freedom of Decision-Making 130 

6.2.5 Experimental Problems 133 

6.2.6 Measurement of Behaviours 133 

6.3 Summary 135 




Table of Contents 



IX 



7 Decision Theoretical Specification of the Transition Rates 137 

7.1 Introduction 137 

7.2 Derivation 138 

7.2.1 The Multinomial Logit Modell 139 

7.2.2 Entropy Maximization 141 

7.2.3 Fechner’s Law 142 

7.2.4 Utility and Distance Function 143 

7.3 Pair Interaction Rates 146 

7.3.1 Special Applications in the Social Sciences 154 

7.4 Properties of the Utility Approach 155 

7.4.1 Stationary Distribution 155 

7.4.2 Contributions to the Utility Function 157 

8 Opinion Formation Models 159 

8.1 Introduction 159 

8.2 Indirect Interactions 161 

8.2.1 A Period Doubling Route to Chaos 163 

8.2.2 A Ruelle-Takens-Newhouse Route to Chaos .... 163 

8.3 Direct Pair Interactions 164 

8.3.1 Kinds of Pair Interactions 165 

8.3.2 Oscillations 169 

8.3.3 Influence of the Interaction Frequencies 173 

8.3.4 Period Doubling Scenarios and Chaos 180 

8.4 Generalizations 195 

8.5 Spatial Spreading of Opinions 196 

8.5.1 Opinion Spreading by Diffusion 196 

8.5.2 Opinion Spreading by Telecommunication 198 




X 



Table of Contents 



9 Social Fields and Social Forces 203 

9.1 Introduction 203 

9.2 Derivation 204 

9.3 The Social Force Model 207 

9.3.1 Comparison with Lewin’s ‘Social Field Theory’ 212 

9.4 Computer Simulations 214 

9.4.1 Imitative Processes 217 

9.4.2 Avoidance Processes 223 

10 Evolutionary Game Theory 227 

10.1 Introduction 227 

10.2 Derivation of the Game Dynamical Equations 228 

10.2.1 Payoff Matrix and Expected Success 228 

10.2.2 Customary Derivation 228 

10.2.3 Fields of Application 229 

10.2.4 Derivation from the BOLTZMANN-Like Equations .... 230 

10.3 Properties of Game Dynamical Equations 232 

10.3.1 Non-Negativity and Normalization 233 

10.3.2 Formal Solution 233 

10.3.3 Increase of the Average Expected Success in Symmetrical 

Games 234 

10.3.4 Invariant of Motion for Antisymmetrical Games 235 

10.3.5 Interrelation with the Lotka-Volterra Equations . . 236 

10.3.6 Limit Cycles and Chaos 238 

10.4 Stochastic Version of the Game Dynamical Equations 239 

10.4.1 Self-Organization of Behavioural Conventions for the 

Case of Two Equivalent Competing Strategies 242 




Table of Contents 



XI 



11 Determination of the Model Parameters from Empirical Data255 

11.1 Introduction 255 

11.2 The Case of Complete Data 255 

11.3 The Case of Incomplete Data 258 

11.3.1 Parameter Estimation 262 

11.3.2 Model Reduction 268 

11.4 Migration in West Germany 269 

11.4.1 First Model Reduction 272 

11.4.2 Second Model Reduction 272 

11.4.3 Comparison of the Weidlich-Haag Model and the Ge- 
neralized Gravity Model 276 

11.4.4 Third Model Reduction 279 

11.5 Evaluation of Empirically Obtained Results 279 

11.5.1 Sensitivity Analysis 279 

11.5.2 Decomposition of the Utility Functions with Respect to 

Explanatory Variables 281 

11.5.3 Prognoses 282 

11.6 Examples for Decompositions of Utility Functions 283 

11.6.1 Purchase Pattern 283 

11.6.2 Voting Behaviour 285 

11.6.3 Gaps in the Market and Foundations of New Parties . . 289 

List of Symbols 291 

Notation and Conventions 291 

Frequently Occuring Symbols 293 

Greek Symbols 301 

Operators and Special Symbols 303 

References 307 

Index 323 




Preface 



This book presents various new and powerful methods for a quantitative and 
dynamic description of social phenomena based on models for behavioural chan- 
ges due to individual interaction processes. Because of the enormous complexity 
of social systems, models of this kind seemed to be impossible for a long time. 
However, during the recent years very general strategies have been developed 
for the modelling of complex systems that consist of many coupled subsy- 
stems. These mainly stem from statistical physics (which delineates fluctuation 
affected processes by stochastic methods), from synergetics (which describes 
phase transitions, i.e. self-organization phenomena of non-linearly interacting 
elements), and from chaos theory (which allows an understanding of the un- 
predictability and sensitivity of many non-linear systems of equations). 

Originally, these fascinating concepts have been developed in physics and ma- 
thematics. Nevertheless, they have often proved their explanatory power in va- 
rious other fields like chemistry, biology, and economics. Meanwhile they also 
receive an increasing attention in the social sciences. However, most of the re- 
lated work in the social sciences is still carried out on a qualitative level though 
in complex systems some phenomena can only be adequately understood by 
means of a mathematical description. 

Up to now, there is not very much literature available that deals with a quan- 
titative theory of social processes. Apart from the approaches of game theory, 
pioneering models have been developed by WEIDLICH and Haag concerning 
spontaneous behavioural changes due to indirect interactions [285,292,289]. 
The work presented in this book was stimulated by both approaches which first 
seemed to be incompatible. However, the main focus was put on the concept 
of direct pair interactions. This was first applied to the description of dynamic 
pedestrian behaviour [123,125] but it can be extended to the delineation of a 
variety of other social processes, e.g. of opinion formation. 

It turned out that the concepts of indirect interactions and pair interactions can 
be unified in one general model. Surprisingly, by incorporating decision theore- 
tical approaches a very fundamental model resulted which seems to open new 
perspectives in the social sciences. It includes as special cases many established 
models, e.g. the logistic equation for limited growth processes, the gravity mo- 
del for exchange processes, some diffusion models for the spreading of informa- 
tion, the game dynamical equations for cooperation and competition processes, 
and Le win’s social field theory that describes behavioural changes by dynamic 
force fields. Therefore, this model is well-founded and could be viewed as some 
kind of metatheory. In addition, it implies many new results. Consequently, 
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Preface 



in the course of time a large amount of material was accumulated concerning 
the foundation, generalization, and unification of behavioural models most of 
which is included in this book. Except for the omitted chapter on pedestrian 
behaviour, this book is essentially the translation of an expanded version of my 
PhD thesis [132] which has won two research prizes. It includes new results and 
many fruits of discussions I had in connection with presentations at a number 
of international conferences (e.g. the STATPHYS 18 in Berlin, 1992, and the 
XHIth World Congress of Sociology in Bielefeld, 1994). 

‘Quantitative Sociodynamics’ is directed to the following readership: First, to 
natural scientists, especially physicists and mathematicians who are interested 
in a comprehensive overview of stochastic methods and/or their application to 
interdisciplinary topics. Second, to social scientists who are engaged in mathe- 
matical sociology, decision theory, rational choice approaches, or general system 
theory. Moreover, this book supplies a number of relevant methods and models 
for economists, regional scientists, demographers, opinion pollsters, and market 
researchers. 

It is assumed that the reader possesses basic mathematical knowledge in calcu- 
lus and linear algebra. Readers who are not interested in mathematical details 
may continue with Part II after having read the different introductions prece- 
ding it. Moreover, those who are interested in the basic ideas only may skip 
the formulas during reading. The underlying terms, principles, or equations can 
always be looked up in Part I if necessary. An additional help is provided by 
the detailled ‘Index’ and the ‘List of Symbols’ which contains supplementary 
information about mathematical definitions and relations. Readers interested 
in an introduction to calculus, linear algebra, and complex analysis can, for 
example, consult some of the References [140,246,165,20,266,164]. A more 
detailled discussion of stochastic methods, synergetics, and chaos theory can 
be found in References [84, 151, 107, 106,254, 183, 135]. 

Here I want to thank W. Weidlich for teaching me his fascinating ideas, 
for his encouragement, his numerous useful comments, and the freedom he 
gave me concerning my work. Moreover, I am grateful to A. Diekmann, W. 
Ebeling, N. Empacher, M. Eigen, G. Haag, H. Haken, I. Prigogine, 
R. Reiner, P. Schuster, F. Schweitzer, K. Troitzsch, and many others 
for stimulating discussions and/or the inspiration by their work. I am very 
much indebted to R. Calek for his tireless engagement in connection with the 
translation of the German manuscript and to A. Dralle as well as R. Helbing 
for their many useful advices concerning the final corrections. Moreover, I would 
like to thank M. SCHANZ for his help regarding the numerical calculation of the 
Liapunov exponents in Section 8.3.4. Finally, I am pleased about the support, 
motivation, and understanding of my friends, relatives, and collegues. 
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Introduction and Summary 



The field of quantitative sociodynamics is still a rather young and very 
thrilling interdisciplinary research area which deals with the mathematical mo- 
delling of the temporal evolution of social systems. In view of the growing 
complexity of social, economic, and political developments quantitative models 
are becoming more and more important — also as an aid to decision-making. 
From a scientific point of view as well, a mathematical formulation of so- 
cial interrelations has long been overdue. Compared with purely qualitative 
considerations it allows clearer definitions of the terms used, a more concentra- 
ted reduction to the interesting interrelations, more precise and more compact 
descriptions of structures and relations, more reliable conclusions, better forec- 
asts and, thus, statements which are easier to verify [102,231]. Apart from this 
it turned out that many social phenomena cannot even approximately be under- 
stood with static concepts. Some dynamic social processes are not comprehen- 
sible as a sequence of time-dependent equilibrium structures. The description 
of self-organization and structure formation processes requires dynamic mathe- 
matical concepts which also can describe non-equilibrium phenomena, i.e. 
the temporal evolution of systems which become unstable. 

Because of the complexity of social systems a mathematical formulation of 
social processes was, for a long time, considered to be almost impossible. Never- 
theless, again and again corresponding formulations have been developed by 
social as well as natural scientists. However, the proposed concepts often were 
too simple and did not sufficiently do justice to those phenomena which they 
should describe. 

Only in the recent years, powerful methods have been developed which make it 
possible to describe complex systems consisting of many elements (subsystems) 
that interact with each other. Here pioneering were particularly the concepts of 
statistical physics for the description of chance affected processes and disco- 
veries in the field of non-linear dynamics which was shaped by synergetics 
[107, 106], chaos theory [254, 16, 183, 135], catastrophe theory [273,302], as 
well as by the theory of phase transitions and critical phenomena [145, 
177, 202,263]. These again and again proved their interdisciplinary explanatory 
power and allow the development of models for social processes at an actually 
much sounder level than it was feasible only a short time ago. In the meantime 
a remarkable list of physicists exists who already dealt with topics from the field 
of mathematical sociology: Weidlich [285,286,288-290], Haag [103], Haken 
[107,300], Ebeling [56,57,59], Malchow [179], Schweitzer [257], Prigo- 
gine [225,226], Allen [2—6], Montroll [91,185-187], Mosekilde [191,192], 
Huberman [89,90,147], Walls [284], and Lewenstein [168]. 
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During the next years, in the field of the mathematical description of social phe- 
nomena we can surely expect a rapid scientific development and fundamentally 
new results from the various disciplines involved. 



Quantitative Models in the Social Sciences 



Despite the difficulties with the mathematical modelling of social processes 
already for some time a number of formal models has been developed by so- 
cial scientists. These include Osgood and Tannenbaum’s principle of con- 
gruity [213], Heider’s balance theory [119,120], and Festinger’s disso- 
nance theory [74] which deal with the stability of attitude structures, to 
mention a few. Very interesting is also the model suggested by Lewin which 
assumes that the behaviour of an individual is determined by a ‘social field’ 
[169]. Further mathematical models were developed for learning processes 
(cf. [231], Chap. 13) and decision processes [48,53,174,209,297]. The first 
stochastic models for social processes go back to Coleman [43] and BAR- 
THOLOMEW [18]. In addition, numerous diffusion models [50,98,110,157, 
178,247] are worth mentioning. Of big importance is also the description of 
the competition and cooperation of individuals within the framework of game 
theory which was founded by von Neumann and Morgenstern [196]. Last 
but not least the event analysis or survival analysis received considerable 
attention [51,275]. That is, meanwhile a broad and substantial literature exists 
in the field of mathematical sociology (cf. also [44,187,231]). In the follo- 
wing those models which play a particular role in the course of this book will 
be discussed in more detail. 



The Logistic Model 

For growth processes an exponential temporal development was assumed for 
a long time. In many cases, however, this is not empirically confirmed. Rather it 
turned out that growth processes with limited resources can well be characte- 
rized by the logistic equation of Pearl and Verhulst [217,281]. According 
to the logistic equation growth processes start in a roughly exponential way 
and then turn into a saturation phase in which a certain system dependent 
maximum value is gradually approached (cf. Sect. 2.5.2). Exponential growth 
occurs for unlimited resources only. 

The applications of the logistic equation reach from the description of chemical 
reaction rates, the growth of animal and plant populations in the absence of 
enemies, and the growth of towns up to the description of the spreading of 
information or innovations [18,43,91, 110, 130, 187,206]. 
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Diffusion Models 

The topic 4 distribution processes’ is generally the subject of diffusion mo- 
dels. In detail these deal with the spreading of such different ‘things’ as diseases, 
rumours, conventions, norms, standards, knowledge, technologies, or goods [18, 
43,50,98,110,157,178,247]. From the point of view of spatial distribution 
processes there are similarities to diffusion processes in physics. An example 
is the dispersal of dye molecules of ink drops in a glass of water caused by the 
random motion of particles (cf. Sect. 8.5). 

However, the spatial dispersal under inhomogeneous conditions such as the dis- 
semination of information within the irregularly distributed population of a 
country is more complicated. This is particularly true if the information is not 
distributed by central media but via personal communication or telecommuni- 
cation (cf. Sect. 8.5.2). 



The Gravity Model 

The gravity model can be applied to the description of exchange processes 
such as the exchange of goods, the traffic volume, or the removals (‘migra- 
tion’) between different towns or regions (cf. e.g. Refs. [235,305]). Although 
the background is completely different, the name of the gravity model stems 
from the mathematical similarity with the law of gravity which applies to the 
attraction between celestial bodies as earth and moon. 

The gravity model bases on the observation that the frequency of the considered 
exchange (e.g. migration) is proportional to the size of the ‘source’ (to the 
number of inhabitants there) and to the size of the ‘sink’ (to the number of 
inhabitants of the destination town since this is a measure for the cultural 
infrastructure, employment facilities, etc.). The exchange frequency decreases 
proportionally to the reciprocal value of the distance between source and sink. 
This is, on the one hand, because the transaction costs (e.g. removal costs) 
increase with distance but, on the other hand, because the number of alternative 
towns grows with the distance (cf. Sect. 7.3.1). 



The Game Theory 

Whereas the up to now introduced quantitative models describe relatively sim- 
ple phenomena, game theory is a convenient tool to comprehend rather com- 
plicated phenomena. Above all, this applies to the modelling of social compe- 
tition and cooperation on the basis of a success or utility oriented behaviour 
of individuals. Important applications stem from economics (e.g. in the oli- 
gopoly theory [269,280]) as well as from the social and behavioural sciences 
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(e.g. concerning the ‘prisoner’s dilemma’) [12, 13, 175, 196,234,252,253,283] 
but also from biology (with respect to evolution theory or ecology) [60,61, 
78,139,256]. 

It turned out that in order to understand particularly the development of co- 
operation among egoists, the repeated interaction (and thus the time dimension) 
is of great relevance (‘shadow of the future’) [13]. To consider this, a mul- 
tiple repetition of the single game and decision moves was applied for a long 
time (‘iterated games’). Nowadays the game dynamical equations find 
more and more resonance [137,139,256,271,303] (cf. Chap. 10). These are dif- 
ferential equations and thus continuous in time which at least is more realistic 
if there are no single, clearly defined moves at discrete times. 

The game dynamical equations originate from the theory of evolution [137, 
271,303] where they celebrate great successes. Therefore, in connection with 
the game dynamical equations, we also speak of evolutionary game theory. 
However, in contrast to the theory of evolution where the equations can be 
derived from the laws of genetics (cf. Ref. [138], pp. 43ff), no ‘microscopic’ 
explanation of the game dynamical equations could be presented for a long 
time. Such an explanation of the game dynamical equations on the basis of 
individual behavioural principles will be introduced in Section 10.2.4 [128]. 

Furthermore, in Section 10.4.1 a game dynamical model for the spontaneous 
self-organization of behavioural conventions in the case of competing 
equivalent behavioural strategies will be developed [128,131]. A well-known 
example for this is the rivalry of the originally equivalent video systems VHS 
and BETA MAX [9,116] which was very clearly decided by the purchasers’ 
behaviour. Here three questions arise: Why at all can one of both equivalent 
alternatives win out over the other? How long does it take to gain a certain 
market share? Why does a certain product win out over its competitors in 
certain branches of industry whereas in other branches several products coexist? 

The solution of these questions was found in connection with the description of 
a phenomenon which is known from the behaviour of pedestrians [123, 124]. 
As one can easily observe, in dense pedestrian crowds lanes of uniform wal- 
king directions are formed. With respect to the walking direction, these lanes 
move (in Germany and some other countries) almost always on the right-hand 
side though the left-hand side is completely equivalent [204,205]. The reason 
for this is that an avoidance manoeuvre is sucessful only if two each other 
encountering pedestrians both swerve either to the right or to the left. Other- 
wise they must slow down or stop in order to avoid a collision. Therefore, it 
is advantageous for a pedestrian to choose the strategy which is shared by the 
majority. As soon as accidentally a small majority favouring a certain strategy 
arises it will grow on and on. Consequently, a behavioural convention origina- 
tes which, in the end, is caused by social self-organization. Once established 
conventions are often fixed by laws or norms after some time. 
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Nevertheless, there are also mechanisms which hinder the triumph of one be- 
havioural alternative. One example for these are spontaneous strategy changes 
which result from the occasional trying out of behavioural alternatives (trial 
and error). These can be taken into account mathematically as well. It is 
expected that spontaneous strategy changes somewhat reduce the attainable 
majority of the developing behavioural convention. For a small frequency of 
spontaneous strategy changes this is actually true. However, if this frequency 
exceeds a critical value the formation of a behavioural convention will be sup- 
pressed and each of the two strategies will be chosen with equal probability. 
Whether one of many equivalent behaviours can win out at all depends on 
whether the advantage which results from an adaption to the behaviour of the 
majority exceeds the influence of spontaneous strategy changes. 

In the case of the avoidance problem spontaneous strategy changes obviously 
play a subordinate role. Otherwise the preference for the right-hand side (in 
Germany) would not have developed. However, with the same probability a 
preference for the left-hand side could have emerged. The existing conventions 
can vary from one region to another. This applies, for example, to the driving 
direction of road traffic. Whereas in Great Britain left-hand traffic is prescri- 
bed, in the rest of Europe right-hand traffic is usual. Thus an explanation of 
the symmetry breaking regarding avoidance behaviour on the basis of the 
asymmetry of the human body is to be excluded. 

The described model for the development of conventions is also applicable to 
the revolution direction of clock hands, the direction of writing, and many other 
examples where it is advantageous to choose the same behavioural alternative 
as the majority. In the case of the rivalry of the above mentioned equivalent 
video systems it is the compatibility of the recorders (concerning copying, selling 
or hiring of tapes) which makes it profitable to join a once accidentally arisen 
majority. 

The above model for the self-organization of behavioural conventions can easily 
be extended to the case of more than two equivalent behavioural alternatives. 
Furthermore, situations can be described where one behavioural alternative is 
superior to the others. However, here the formation of a behavioural convention 
is trivial. 



Decision Models 

It is well-known that decisions are processes the outcome of which normally is 
not predictable. At best, the probability for a particular outcome can be esti- 
mated. A very detailed model for decision processes stems from Feger [69] 
(cf. Sect. 6.2.4). According to this, decision situations are conflict situations in 
which one from several behavioural alternatives must be chosen. The decision 




6 



Introduction and Summary 



process mainly consists of a collection of arguments which are in favour of one 
or another alternative (‘anticipation of consequences’). Here the order of 
arguments is quite accidental. The frequency of arguments supporting a cer- 
tain alternative increases with the preference for it. If a certain number of 
arguments one after the other supports one and the same behavioural alterna- 
tive the decision is made. Therefore, the probability of a decision for a certain 
alternative depends on the probability of arguments supporting it and thus on 
the preference for it. 

Nevertheless, in many cases it is not the decision process which is of interest 
but only the outcome of the decision. The functional dependence of the de- 
cision probability for a particular behavioural alternative on the preference is 
described by the multinomial logit model [53]. In this connection, a mea- 
sure for the preference is the utility which is expected as a consequence of a 
certain decision. The multinomial logit model proved itself in various areas. Its 
applications reach from purchase patterns up to the choice of the location of 
residence or of the means of transport [48,209,297]. 

The fact that a decision usually does not have a unique outcome is related to 
the incompleteness of information about the actual utility. Naturally, the 
estimate for the actual utility normally varies among individuals and is possibly 
also dependent on the mood. The more uncertain the knowledge about the 
actual utility of the possible behavioural alternatives is the more probabilistic 
the outcome of the decision. 

If one tries to develop a dynamic behavioural or decision model, the problem 
will arise that a temporal sequence of decisions is to be described. In this case, 
the single decisions of the mentioned decision sequence will again be treated 
with the multinomial logit model. However, the probability of a decision now 
depends on the previously made decision, i.e. on the current behaviour. This 
probability defines the transition probability from one behaviour to another. 
It will be greater than zero if the corresponding behavioural change is connected 
with an increase of utility (cf. Sect. 7.2.1). 



Final Remark 

Up to now, the logistic equation, the gravity model, the diffusion models, the 
game dynamical equations, and the decision models were viewed as single theo- 
retical concepts standing next to each other without any relations. In this book, 
for the first time, a general, mathematical, and dynamic model will be develo- 
ped which includes all these concepts as special cases. This behavioural model 
represents a kind of a metatheory which is — at least in the above mentioned 
cases — empirically well-founded. 
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How to Describe Social Processes in a Mathe- 
matical Way 

Most problems accompanying the attempts to mathematically describe social 
systems arise from their complexity. This causes an extreme sensitivity to 
all sorts of influences so that, in similar situations, social systems sometimes 
behave in a completely different way. An additional difficulty is the chance 
affectedness (randomness, stochasticity ) of individual decisions which also 
influences the concrete behaviour of a social system (cf. Chap. 6). 

For a long time this made the mathematical description of social systems ap- 
pear generally impossible. During the last years, however, current insights in 
complex systems have shed some light on these problems. Above all, they 
come from the fields of non-linear dynamics (synergetics, chaos theory) 
and statistical physics (where numerous stochastic methods were developed 
for randomly behaving systems). 

In Part I of this book, the most important concepts of these fields, their inter- 
relations, and properties will be discussed in a unified and self-consistent form. 
On this occasion, entirely new methods are also considered. One of these is 
the ‘path integral’ solution of the master equation which is applicable to 
problems such as process control and event analysis [51,275]. 

Non-linear dynamics and statistical physics have often offered successful ex- 
planations even for non-physical phenomena, e.g. in chemistry [208,85,181, 
200], in biology [8,92,91] as well as in the economic and social sciences 
[18,42,43,285,286,292,294,289,291]. For the investigation of a given complex 
system, the way of proceeding usually is to find appropriate stochastic equa- 
tions which are afterwards further examined with the methods of non-linear 
dynamics. It is to be emphasized that what is done here, in contrast to the out- 
dated ‘physicalistic’ approaches, is no model transfer but a methodological 
transfer. A simple model transfer by merely reinterpreting the model variables 
were questionable in so far as it would assume the compared systems to possess 
the same properties (characteristics) — at least in those aspects that are funda- 
mental for the model. However, this ‘isomorphism’ often is only insufficiently 
guaranteed. In contrast to this, a methodological transfer considers the system 
specific properties explicitly by the respective model assumptions. 



Statistical Physics and Stochastic Methods 

For some decades, stochastic equations have proven a versatile and useful 
instrument in all fields of science which deal with fluctuations (random va- 
riations) [117,188,189,151]. Some phenomena become clearly comprehensible 
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only with stochastic models. Above all, this applies to phase transitions and 
critical phenomena [145,177,202,263]. Especially, the phase transitions are 
of particular interest because they mean a fundamental change of the system 
properties. An example for this is the transition from ice to water or from water 
to steam. 

Statistical physics deals with systems consisting of a huge number of elements 
(subsystems) which possibly can be subdivided into several types of elements 
(e.g. different sorts of gas molecules). The elements of every type (i.e. equivalent 
subsystems) must be present in a huge number. Furthermore, let us assume that 
each element can be characterized by a number of time- dependent variables 
(quantities) which together describe its state and can be combined in a state 
vector. The elements have interactions with each other, i.e. they can change 
their states mutually, e.g. by exchanging energy or information. Apart from 
interactions there may occur also spontaneous state changes (which result 
from the properties of the elements or are induced by external influences on 
the system). 

Even if the exact laws for the behaviour of the elements of such a system 
and their interactions are known, due to the huge number of elements it is 
impossible to exactly calculate their behaviour. In most cases, however, the 
point of interest is not the behaviour of the single elements (i.e. their temporal 
state changes) but rather the distribution of states within the system. 

It turned out that for the description of the temporal evolution of the state 
distribution probabilistic models (stochastic equations) are suitable [84, 
126]. Above all, this applies to the master equation [215] (cf. Chap. 1) but 
also to the Fokker-Planck equation [79, 221] which can be understood as a 
Taylor approximation of the master equation (cf. Chap. 4). A similar thing is 
true for the stochastic differential equations (L angevin equations [166]) 
which, in a certain respect, represent a reformulation of the Fokker-Planck 
equations [267] (cf. Chap. 5). The Langevin equations, however, characterize 
the behaviour of single elements the state changes of which, on the one hand, go 
back to a systematic effect which describes the most probable state change. 
On the other hand, they stem from fluctuations which reflect individual varia- 
tions of the state changes. In contrast to this, the Fokker-Planck equation 
as well as the master equation describe the temporal evolution of the state 
distribution of many subsystems (i.e. of an ‘ensemble’ of elements). 



The master equation 

In order to be able to describe the time-dependent change of the state distri- 
bution, it is not necessary to know in detail which elements experience which 
state changes (transitions). However, we must know how many elements per 
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time unit go over to the other possible states. Thus, it will not matter if a 
•particular element is subject to a certain state change or another one , instead. 

The temporal evolution of the state distribution is given by several quantities. 
One of these is the present state distribution itself because it characterizes how 
many elements can potentially go over from a particular state to another. The 
rate of such state changes is described by so-called transition rates. These 
are, on the one hand, given by the frequency (per time unit) with which 
spontaneous or interaction induced transitions take place at all. On the other 
hand, they are determined by the (transition) probabilities with which chan- 
ges from the respective occupied state into the other available states occur. In 
order to describe a system of the considered kind mathematically, the transition 
rates must be determined in a system specific way. The resulting equation for 
the temporal evolution of the state distribution is a master equation which 
describes the concrete processes within the system. 

Up to now, we treated the master equation in state space which consists of 
state vectors. An analogous master equation can also be formulated for con- 
figuration space consisting of vectors that combine the occupation num- 
bers (frequencies) of all states in the configuration of a system. The master 
equation in state space then corresponds to the approximate mean value 
equation of the related master equation in configuration space. 

The configurational master equation has some essential merits [133,289]: First, 
it allows to use individual transition rates which depend on the occupation 
numbers of the states. Such cases are important for the description of indi- 
rect interactions which remarkably often occur in social systems. Second, the 
configurational master equation makes possible to gain equations which take 
into account pair interactions or even higher order interactions. Third, 
from the configurational master equation covariance equations can be de- 
rived which are essential for the determination of the reliability of the mean 
value equations. Finally, it allows to calculate corrections for the approximate 
mean value and covariance equations (cf. Chap. 3 und Sect. 10.4) 



Application to the Description of Social Systems 

Above all, it were Weidlich and Haag from the University of Stuttgart in 
Germany who realized that the master equation is particularly suitable for 
the description of social systems [289,285,286,292,294]. The elements (sub- 
systems) then represent individuals (human beings, persons) and the states 
stand for certain aspects of their behaviour (e.g. for the expression of opinions 
concerning a special affair). In these models, the distinguished behaviours have 
only to be classified but not to be linearly ordered , and they need not to form a 
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continuous behavioural space. Thus, problems which are related to the measu- 
rement or scaling of social quantities do not arise. Nevertheless, by analyzing 
empirical data, the degree of dissimilarity of the different behaviours (their ‘di- 
stance’) can be determined by a method introduced in Chap. 11. From this, 
by means of multidimensional scaling [161,301], a behavioural space can 
be constructed afterwards. 

Above all, the problem is to find a suitable system specific approach for the 
transition rates. However, for this problem a solution was already formulated in 
Section ‘Decision Models’. There it turned out that the transition probability 
with which an individual decides in favour of a new behaviour is determined by 
the utility of the behavioural change and can be descibed by a multinomial 
logit model (cf. Chap. 7 and [289], §3). 

Since this formulation can be considered as sufficiently founded, it is not requi- 
red to laboriously determine the utility of each behaviour by means of extensive 
surveys and suitable calibrations. It is rather simple to calculate the utilities 
from data material describing the temporal course of the behavioural distribu- 
tion (e.g. opinion distribution) and the transition rates [294] (cf. Chap. 11). 

With the empirically determined utilities, for instance, the analysis and pro- 
gnosis of voting or purchasing behaviour is possible. For economy or politics 
it is, furthermore, of particular interest which factors contribute with which 
importance to the utility of certain purchasing or voting decisions. This can 
be determined from suitable data material via a regression method in which 
it is assumed that the utility, on the one hand, depends on the interests of 
purchasers and voters respectively and, on the other hand, on the various cha- 
racteristics of the offered products or political concepts (cf. Sect. 11.6 and Refs. 
[238,294]). 

The circumstance that the utility of a behaviour (and so the probability of the 
corresponding behavioural change) often varies among different individuals can 
also be taken into account. For this purpose, the individuals of the considered 
social system are divided into several (behavioural) types (‘characters’). In- 
dividuals of the same behavioural type should tend to the possible behavioural 
changes with similar probabilities. 

In order not to have too many behavioural types and to obtain large sub- 
populations (defined by the individuals of the same type) the investigation 
is restricted to simple behavioural processes and simple situations (cf. 
Sect. 6.2.2). Then it does not matter how the single individuals behave in other 
situations and areas of life. It is even admissible that the modelled behaviours 
are influenced by other areas of behaviour as long as these influences have no 
systematic effect (with respect to the average of the individuals of a subpopula- 
tion). In the case of voting behaviour, for instance, it could suffice to determine 
the subpopulations by the level of income, education, and confession. 
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Weidlich and Haag have developed master equation models for the descrip- 
tion of a multitude of different social phenomena. Among them there are models 
of opinion formation, migration models, and models for the emergence of settle- 
ment structures, but also non-equilibrium models for economic decisions [289, 
290,292,291]. In the case of the migration models, a comparison with empiri- 
cal data could be carried out. This provided very good results for the model 
assumptions concerning the transition rates (i.e. the frequency of behavioural 
changes) [294]. 



BoLTZMANN-Like Equations 

The models of Weidlich and Haag base on indirect interactions of indi- 
viduals, i.e. the frequency of individual behavioural changes depends on the 
so cio configuration or the behavioural distribution within the social system 
[289,292]. Therefore, the behaviour of an individual has, in turn, an effect on 
himself/herself (feedback effect). 

Indirect interactions are especially suitable for the description of behavioural 
changes which are indirectly induced by the media (radio, TV, newspapers), 
caused by generally accessible information, or evoked by the ‘so cio- cultural and 
political climate’. In physics they play a role in mean field theories [24] which 
are, for instance, used for the description of the magnetization of iron (Ising 
model: cf. Refs. [10,148,304]). 

Many behavioural changes, however, go back to direct interactions. This, 
for example, concerns behavioural changes which are triggered by the beha- 
viour of other individuals. A typical case for this are opinion changes caused 
by discussions. The most important effects of direct interactions can already 
be understood by considering pair interactions which normally yield the lar- 
gest contribution. In principle, however, simultaneous interactions of higher 
order (interactions between several individuals) can be also treated mathema- 
tically (cf. Sect. 3.4). 

In order to describe pair interactions, in contrast to Weidlich and Haag’s 
master equation models, additional terms are required which take into ac- 
count that the rate of the corresponding behavioural changes is proportional 
to the number of potential interaction partners. The resulting equations are 
BoLTZMANN-like equations (cf. Chap. 2). The fundamental significance of 
these becomes particularly clear in the following quotation by Uhlenbeck 
[276]: 



“The Boltzmann equation has become such a generally accepted 
and central part of statistical mechanics, that it almost seems 
blasphemy to question its validity and to seek out its limitations.” 
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The BoLTZMANN-like equations were originally developed for the description 
of the behaviour of gases where the pair interactions are caused by collisions 
(scattering processes) of gas particles. The transferability to avoidance and 
deceleration manoeuvres at encounters of pedestrians is obvious (cf. Refs. [123, 
125]). 

Normally, however, the pair interactions in social systems are of a totally dif- 
ferent nature. In the case of opinion formation, for instance, pair interactions 
are to be imagined in such a way that two individuals with certain opinions 
occasionally communicate with each other and possibly accept other opinions 
as a consequence of their exchange of ideas. So, instead of velocities this time 
opinions are changed. Moreover, in the case of opinion formation (but also 
of behavioural changes in general) there are other kinds of pair interactions 
(cf. Sect. 7.3 and 8.3). One can distinguish between persuasion processes 
in which the opinion of another person is taken over, avoidance processes 
in which a particular opinion is evaded in order to differ from the interaction 
partner, and compromising processes. With the corresponding model, e.g. 
fashion cycles can be well understood [127]. 

Direct and indirect interactions do not exclude each other mutually. They rat- 
her complete each other and can be integrated in a general model. The limi- 
tation to one of both interaction mechanisms could lead to the situation that 
the empirical data can be reproduced only insufficiently. In physics, for exam- 
ple, the properties of nuclei can be explained only satisfactorily by combining 
a mean field with direct residual interactions (‘pairing effect’) (cf. Ref. [278, 
298]). 

Social Fields and Social Forces 

For the case of a continuous or quasi- continuous behavioural space, the 
BoLTZMANN-like equations can be transformed into Boltzmann-Fokker- 
Planck equations by a Taylor approximation (cf. Sect. 4.6 and Ref. [130]). 
These equations have a very intuitive interpretation. Accordingly, there exists a 
vector-like quantity which describes the most probable behavioural change 
of individuals of a certain behavioural type. This quantity can be interpre- 
ted as a social force because it is the drive behind behavioural changes 
and, furthermore, it is influenced by the behaviours of other individuals. It 
could also be said that the social force describes concrete motivations to act 
which are evoked in an individual as a reaction to his/her environment. In the 
Boltzmann-Fokker-Planck equations, apart from social forces, there also 
appear so-called diffusion coefficients. These reflect the individual variations 
of behaviour (cf. Chap. 9, [130]). 

In certain cases the social force can be expressed by a social field. This can be 
imagined as a dynamically changing mountain range in behavioural space where 
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the steepness is a measure for the social force which is at work at a certain place 
of the behavioural space and, accordingly, affects the corresponding behaviour. 
The idea of a social field which describes the influence of public opinion, norms, 
trends, and environment on individual behaviour originally goes back to Lewin 
[169]. It is also related to the mathematical formulation of indirect interactions 
suggested by Weidlich and Haag [292]. 

In some contexts, the description of behavioural changes by social forces proves 
its extraordinary worth. For example, the distance which a pedestrian keeps to 
others is given by an equilibrium of two opposite forces — one force which des- 
cribes the attempt to move with a certain desired speed and another, repulsive 
force reflecting the readiness to respect the territory which another pedestrian 
claims for himself/ herself. If a pedestrian feels a growing pressure of time due 
to delays, the force describing the tendency to get ahead increases compared 
to the repulsive force so that he/she keeps less distance or even pushes. This 
effect can, interestingly enough, also be observed in queues that have come to 
rest though a reduction of distance does not lead to getting on faster [123, 124, 
129]. 

The force model is also a very promising starting point for a quantitative de- 
scription of group dynamics (cf. also p. 16). Here, for instance, attractive 
forces due to certain likings or common interests play a role but also repulsive 
forces due to conflicting goals. 



Non-Linear Dynamics 



The interactions between the single elements of a system are in general non- 
linear, i.e. the effects are not simply proportional to the causes. Thus the 
behaviour of the system as a whole cannot simply be understood as a sum 
or superposition of the behaviours of the constituent elements. Especially, an 
element often has an indirect effect on itself (feedback effect). 

The non-linearity of the interactions has some important consequences. On 
the one hand, it turns out that a few variables often determine the values of 
the other variables (‘slaving priciple’) [106, 107] so that they are sufficient to 
describe the behaviour of the system on their own. On the other hand, the non- 
linearity often causes instabilities which lead to complex dynamics or self- 
organization phenomena, i.e. to the emergence of new system properties. 
(A more detailed and very recommended discussion of this topic is given in 
[239].) 
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The 6 Slaving Principle’ 

The dynamic processes within a considered system can be characterized by 
the complete set of variables which describe the time- dependent states of its 
subsystems. However, it is often more suitable to choose variables which re- 
flect processes that are going on in the system at different ‘speeds’. For the 
determination of a suitable set of variables mathematical methods have been 
developed (cf. Chap. 7 and 8 of Ref. [106]). 

These variables can be subdivided into three classes (‘separation of time 
scales’): First, variables which change approximately at a rate (on a ‘time 
scale’) which corresponds to the rate of changes of the examined processes in 
the system. These are denoted as order parameters. Second, variables which 
change much more slowly. They can be considered as quasi- constant and 
treated as fixed system parameters. Third, variables which change much faster 
than the examined processes and can be understood as fluctuations. Most 
of the time, the fast variables remain near their equilibrium values which can 
gradually move along with the temporal change of the order parameters. 

It can be mathematically shown that the fast variables are already determined 
by the order parameters due to the non-linear coupling (‘slaving principle’). 
Therefore, they can be ‘ quasi- adiabatically’ eliminated from the examined 
equations. As a consequence, from the numerous system variables only a few 
order parameters are required for the description of the substantial processes 
which means an enormous data reduction and information compression. 
(Cf. Refs. [108,106,107].) 

The order parameters reflect the dynamics of the phenomena which take place 
on a ‘macroscopic’ or collective level in a self-consistent way. In general, they are 
described by a closed system of differential equations. Due to the information 
compression the dynamics of the order parameters can agree for totally different 
systems so that in this case one sometimes speaks of universal laws. These 
universal laws are the cause for many analogies between the macroscopic 
behaviour of different systems the elements of which do not show any similarity. 
(Cf. Refs. [106-108,302].) 

The method of the separation of time scales will be discussed in more detail 
in Sections 1.2.3 and 8.3.4. Moreover, it is important in another context be- 
cause it means that for describing social processes many conceivable influences 
play no essential role. So, for the modelling of behavioural changes, it will not 
be essential to comprehend in detail each of the different thought proces- 
ses (trains of thought) if the thought processes are clearly faster than the 
behavioural changes resulting from these. On this condition, the behavioural 
changes can be understood as reactions occuring with certain probabilities to 
the perceptions that take place (cf. Sect. 6.2.1). 
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It should be emphasized that variables which change on a comparable time 
scale as the examined processes need not necessarily to have an influence on 
the latter since they possibly are not dynamically coupled with those at all. 
So there are several behavioural areas which hardly mutually influence each 
other and thus can be modelled separately. 



Instabilities and Complex Dynamics 

Non-linear equations can exhibit a much more complex behaviour than linear 
ones. So, instead of a unique solution, several stationary (time-independent) so- 
lutions often exist. Depending on its initial state, the system approaches one of 
them. A change of the (control) parameters which characterize the external 
conditions of the system, however, can lead to an instability of the stationary 
solution. As soon as a certain critical point is exceeded, unexpectedly we will 
have an entirely different system behaviour than before, e.g. another statio- 
nary state, stable oscillations (e.g. limit cycles), or even chaos [106] (cf. 
Sect. 8.3). The sudden change of the system behaviour is indicated by critical 
fluctuations (large state oscillations) and a critical slowing down (a very 
slow approach of the stable solution). 

The term ‘chaos’ denotes a very complex dynamic behaviour which is extremely 
sensitive to the initial state. If the latter is varied only inconceivably little, 
the system may evolve completely different in the course of time (‘butterfly 
effect’). An example for chaos regarding opinion formation will be introduced 
in Section 8.3.4 [131]. Chaotic behaviour is probably the cause of the previously 
mentioned sensitivity and unpredictability of complex social processes. 



Self- Organization and Emergence 

When passing through critical points (points of instability) by changing con- 
trol parameters, the system behaviour changes drastically. In this context, one 
often speaks of phase transitions, self- organization processes, or the 
emergence of new system properties. The term ‘self-organization’ is justi- 
fied since the changed system behaviour is not determined in detail by the 
environmental influences (e.g. the boundary conditions) but bases on the 
autonomous dynamics of the system. (Cf. Refs. [107, 106]). 

In Section 9.4, for example, it can be found that the opinion distribution in a 
subpopulation will develop completely differently in time if the control para- 
meter ‘indifference with respect to the held position’ exceeds a certain critical 
value. Whereas below this value one sticks to the personally preferred position, 
above the critical value it will be abandoned in favour of new positions. 
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A typical phase transition in physics is the transition from gaseous to liquid 
state when the control parameter ‘temperature’ is reduced below the condensa- 
tion point. Partially related with the formation of liquid drops is the grouping 
of individuals. Whereas, due to the respective individual behaviour, the in- 
ternal structure of a group is subject to fluctuations, to the outside a group 
behaves as a unity. So it can be described by its own, ‘macroscopic 5 (collec- 
tive) variables which change at a considerably slower rate than the individual 
behaviours. 

If the interest is focused on the interactions of groups instead of on the inter- 
actions among individuals, one can find a similar phenomenon — the formation 
of organizations which change even slower than groups do. Finally, the organi- 
zations found a society. 

Obviously, in this case the non-linear dynamics gives rise to a formation of 
different levels which are hierarchically ordered one below the other. A si- 
milar situation can be found in physics with regard to elementary particles. 
These form nuclei which, with electrons, combine to atoms. And the atoms 
form chemical molecules. These organize themselves, for instance, as solids 
which altogether yield celestial bodies. In biology comparable hierarchies can 
be observed. 

On each lowest level we find the strongest interactions. This is obviously the 
reason for the circumstance that here the state changes proceed in the fastest 
way. If these interactions are attractive bonds will arise. These causes the ele- 
ments to behave no longer completely individually but to form units (systems) 
which represent the elements (subsystems) of the next level. Since the at- 
tractive interactions are more or less ‘saturated’ by the bonds, the interactions 
within these units are stronger than the interactions between them (boundary 
formation). The relatively weak residual interactions between the formed 
units induce their relatively slow state changes. Consequently, a general inter- 
dependence between the interaction strength, the rate of state changes, and 
the formation of hierarchical levels can be found. 

In this section the connections with general system theory [22,33,42,176, 
214,232] came to light especially clearly. With the quantitative approaches dis- 
cussed above we can expect that a mathematical formulation of system theo- 
retical concepts is possible even on a much larger scale. 




Part I 



Stochastic Methods and 
Non-Linear Dynamics 




Overview 



Part I of this book deals with stochastic methods and the basic principles 
of non-linear dynamics, which will be taken up again in Part II. In this 
way the mathematical formalism can be developed in its full generality and 
the interrelations between the various treated equations can be particularly 
elaborated. Further, in the second part which focuses on applications the scope 
of formal aspects can be lessened so that the results will stand out more clearly. 

In detail, the way of proceeding in Part I is as follows: First, in Chapter 1 
the master equations for state space will be developed. In particular, the 
connection to microdynamics (to quantum mechanics) will be shown. After- 
wards, a short overview of the most important properties of the master equation 
will be given. Especially, a new solution method will be introduced, namely the 
‘path integral’ solution. Since this allows the calculation of path dependent 
quantities, it is suitable for process control and event analysis. 

In Chapter 2, by factorization of the pair distribution functions, Boltzmann- 
like equations will be obtained. These consider spontaneous transitions 
and pair interactions of subsystems but neglect interactions of higher order. 
Summarizing systems of equal type the equations simplify enormously 
because the number of variables is drastically reduced. 

A master equation for configuration space (the space of occupation num- 
bers) is derived in Chapter 3. This allows the treatment of occupation number 
dependent transition rates which play an essential role in social processes. The 
master equation in state space and the BoLTZMANN-like equations result from 
it as approximate mean value equations. However, the configurational 
master equation allows also the derivation of covariance equations and cor- 
rected mean value equations as well as the consideration of higher order 
interactions. 

For various reasons, the derivation and solution of the covariance equations 
should not be omitted: First, these are necessary to determine the time in- 
terval for which the approximate or corrected mean value equations are valid. 
Second, they are required to estimate the reliability of prognoses. Third, 
they distinguish various stochastic processes which possess the same mean value 
equations. 

By means of a Taylor expansion of the master equation a Fokker-Planck 
equation will be obtained in Chapter 4. One of its advantages is that it is of- 
ten easier to solve than the master equation and depends only on the first two 
jump moments (the drift and diffusion coefficients). Finally, for systems which 
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consist of many subsystems the ‘Boltzmann-Fokker-Planck equati- 
ons’ result by factorization of the pair distribution functions in the Fokker- 
Planck equation or by Taylor expansion of the BoLTZMANN-like equations. 
Since they depend non-linearly on the distribution functions, a suitable strat- 
egy for their solution seems to be a self-consistent method similar to that 
of Hartree. 

In Chapter 5 stochastic differential equations will be introduced. The 
emphasis will be on the Langevin equation which, in a certain sense, is 
equivalent to the Fokker-Planck equation. However, whereas the Fokker- 
Planck equation describes the probability distribution of states, the Lan- 
gevin equation reflects the individual behaviour of single subsystems which 
are subject to the influence of fluctuations. Part I closes with a discussion of 
phase transitions which are characterized by a drastic change of the system 
behaviour. In this connection, the various routes to chaos will be elucidated. 




1 Master Equation in State 
Space 



1.1 Introduction 



Many systems do not change with time in a deterministic (unique) way. Hence 
for the change of a system from a certain state to one out of many possible 
other states often it is only transition probabilities that are given. 1 The 
reason for the uncertainty with respect to the state change depends on the 
special situation: 



• It can result from external influences (disturbances). 

• Also internal fluctuations are possible, e.g. in a system that consists 
of many subsystems (elements). 

• In quantum mechanics the transitions are inherently non- deterministic 
due to Heisenberg’s uncertainty principle [121,122]. However, the 
corresponding probabilities of their occurence can be evaluated. 

• Chaotic dynamics also brings about an uncertainty due to the so-called 

butterfly effect. 



If there is an uncertainty about the concrete change of a system’s state, only 
a stochastic description is possible. That is, we can calculate only the pro- 
bability P(X, t) to find a state X at a future time t. The temporal change of 
the probability distribution is given by 

-j- P(X , t ) = flow into X - flow out of X (1.1) 

at 

where the flow is a measure for the number of transitions per time unit. Due to 
the distinction of inflow and outflow terms the construction and interpretation 
of master equations is particularly simple. 

The ‘flow into X y is the sum of all flows into state X out of the various 
states X ' . Here the ‘flow out of state X' into state X y is proportional to 



J The terms state, system, etc. will be specified in Table I on page 136. 
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the probability P(X f ,t) of state X f to occur and to the transition rate 
w(X\X';t) from X 7 to X, i.e. of the form 

flow out of X' into X = w(X\X']t)P(X',t) . (1.2) 

The transition rate w(X\X f ; t ) denotes the transition probability P(X, t + 
At\X\t) > 0 from state X 7 to state X per time unit At: 

w(X\X';t) := > 0 (X ± X') . (1.3) 

In summary we can write 

flow into X — 53 w(X\X';t)P(X',t). (1.4a) 

X'(5 IX) 

Similarly, the ‘flow out of X’ is just the sum of the flows out of state X into 
other states X 7 . In total analogy we get 

flow out of X = 53 w(X'\X’,i)P(X,t). (1.4b) 

X'&X) 

Inserting (1.4) into (1.1) the following stochastic equation for the temporal 
change of the probability distribution P(Xjt) is obtained: 



jP(X,t)= 53 [wiXlX'-^PiX'^-wiX'lX^PiX^)] 

X'&X) 



(1.5a) 



This equation is called the master equation [215]. The summation includes 
all states X 1 of the state space T. T is the set of all possible states of the 
considered system. In case of T being not discrete but continuous (and of the 
dimension dim T = M) the sum over X' must be replaced by an integral: 



j t P(X,t) = J d M X' 



w(X |X'; t)P(X f , t ) - w(X f \X;t)P(X, t ) 



(1.5b) 



P(X,t) has the meaning of a probability density, then. 

In order to arrive at the master equation (1.5), it must be implicitly presup- 
posed that the considered system can be described by a Markov process 
[159,244]. In simple words, the assumed Markov property means that the 
transition rates from a state X to another state X 7 depend on the previous 
state of the system only. To take into account ‘memory effects’ (in cases 
where the transition rates also depend on the former states of the system) the 
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generalized master equation [156,241] 



= j dt'Y, \wt-t'{X\X'-t')P{X' ,t') - w t - t ,(X'\ X;(')P(X,<')] 

to X ' 



( 1 . 6 ) 



is required. 

Master equation (1.5) would describe a deterministic process if 



1. the transition rates were deterministic which means 

iv(X’\X;t) = 0 for all X exept for one single state X 1 = F(X , t) E T 

(1.7) 

and implies 

w(X'\X; t) = w{X'\X- t)6 x , F(x>t) , (1.8) 

2. the state Xo at an initial time to were given: 

P(X,t 0 ) = S XXo . (1.9) 



Then for all later times t > to the state X'it) of the system would be determined 
in a unique way: 



P(X,t) = 8 X x'(t) • 



( 1 . 10 ) 



Here 



v X y 



1 if x = y 
0 otherwise 



(l.n) 



denotes the Kronecker symbol. 



1.2 Derivation 



Depending on the reason for the uncertainty concerning a system, the correct 
derivation of the corresponding master equation varies. The simplest derivation 
starts directly from the Markov assumption. In the case of external influ- 
ences (disturbances) the transition rates of the considered system depend on 
the probability distribution of the variables describing the external influences. 
For a system that consists of many elements, the fast variables can be adiaba- 
tically eliminated. Then the latter only appear in the form of (internally 
caused) fluctuations of the remaining slow variables. 
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In order to justify the master equation in an accurate way, we must consider 
the microscopic dynamics of a system. The description then starts from a 
quantum theoretical basis, namely with the statistical operator. A pro- 
jection on the relevant variables (with respect to the calculation of expected 
values) leads to the Nakajima-Zwanzig equation. In an appropriate basis 
it takes on the form of a (generalized) master equation [76, 306]. Again the 
fast variables in this master equation can be adiabatically eliminated so that a 
master equation for the relevant macroscopic state variables remains. 

Finally, a derivation of the master equation for chaotically behaving systems is 
possible by means of special mappings, namely so-called Markov partitions 
(cf. [201,45]). 



1.2.1 Derivation from the Markov Property 

Let P(X n , t n ;X„_i, t n _i; • • . ;Xo,to) be the joint probability to find the 
considered system in states Xo , . . . , X n at the times 

to, ti •— to "b At, ..., t n to + nAt . (1.12) 

Here let the times t,* be arbitrarily close to each other, i.e. At — ► 0. According 
to probability theory we have 



, t n -i; * • • ; Xq , to) = 1 

X n X n -i Xo 



due to normalization. Furthermore, 



P{X n ,t n ):= ^ . 


. . ^ ^ P(X n , t n ] X n —i, t n _i; . . . | Xo, to) 


X n - X 


Xo 



(1.13) 



(1.14) 



is the probability to find state X n at the time t n since (1.14) describes the sum 
over all probabilities to arrive in state X n . Moreover, there is 



P(X n , t n \ X n _i, tn — 1) • • • j Xq, to) 

= P(X n , t n |X n _i, t n -lj • • • | Xq, to)P(X n _i, t n - 1 ) . . . j Xq, to) 



(1.15) 



with 

^ ^ -P(X n , t n |X n — i, t n _i; . . . ; Xo, to) = 1 . (1.16) 

X n 

Thus the probability of a system to be in states Xo,...,X n at the 
times to,...,t n is given as a product of the probability to be in states 




1.2. Derivation 



25 



Xo, . . • , X n -i at the times to ) . . and the conditional probability 

P(X n) t n \X n -i, t n _i] . . . ; Xq, to) for a transition to state X n provided that 
the system is in states Xo, . . . ,X n _i at the times ti, . . . ,t n _i. 

Using the Markov assumption that the conditional probability depends 
only on the preceding moment we have 



P(X n ,t n \X n -i,t n -i ] . . .;Xo,$o) = P{X n , t n |X n _i , £n— 1 ) 



(1.17) 



By inserting (1.17) into (1.15) and using (1.14) we obtain 

P(x n ,t)= /»(x ni < B |jr„_i,< B _i)P(x B _ 1) <„_!). (1.18) 

*«-l 



Due to the normalization 

Y^P(X',t + At\X,t)= 1 

X' 

of the transition probabilities (cf. (1.16)) and with 

X :=X n , X':=X n - lt t := *„-i 

we get 

= ±J2P(X,t + At\X',t)P(X’,t) 



(1.19) 



( 1 . 20 ) 



A< 

1 

At 



P(X,t). (1.21) 



^(JO + A^X,*) 

. X ' J 

In the limit At — > 0 we arrive at the master equation 

f t P(X, t) = J2 t)P{X\ t) - w(X'\X; t)P(X, <)] (1.22a) 



with the transition rates 



w(X f \X] t) lim 

v 1 ’ At— ►o 



P{X\t + At\X,t) 
At 



(X'tX). 



(1.22b) 



For X' = X the transition rates w(X , \X;t) can be chosen arbitrarily because 
the relevant terms in (1.22a) compensate each other. Often it is convenient to 
put 



w(X\X]t) :=0. 



(1.23) 
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1.2.2 External Influences (Disturbances) 

We will now consider a system in which the transitions between two states X 
and X ' depend on unknown (e.g. external) quantities Qj. Let us assume that 
the temporal change of the external quantities is independent of the processes 
within the considered system, and let P(Q,t ) be the probability that we have 
the quantity Q = (..., Qj, . . .) tr at the time t. Then the effective transition 
rate of the system at the time t is 

w(X'\ X;t) := Y v(X', Q'\X, Q)P{Q , t) . (1.24) 

Q,Q' 

Because of the dependence on P(Q, t) ) the dynamics of the system can only be 
described in a stochastic way since, in general, it implies 

w{X'\X;t) £ 0 for various X f (1.25) 

even in the case that the transition rates w(X', Q'\X, Q) are deterministic. 
This is often the reason for a time- dependence of the effective transition 
rates w(X'\X] t). 



1.2.3 Internal Fluctuations 



Now let us start from a system the dynamics of which depends on extremely 
many internal variables. Let us assume that we are only interested in the va- 
riables X S} j £ {X/} but not in the variables Xf k € {^/}. By means of the 
decomposition 

X = (X t ,X s f (1.26) 

of the variable vector X and by use of the conditional probability P(Xf\X s ] t) 
we can achieve the representation 



P(X,t) = P(X u X B ;t) = P(X { \X s] t)P(X s ,t) 

with 

P(X f \X s -,t) = 1 and Yl p ( x s,t) = 1 

Xi x , 



(1.27) 

(1.28) 



(cf. (1.13) to (1.16)). Thus the master equation takes on the form 

j t P(Xf,X s ;t) = P(X a ,t)j t P(Xf\X s ;t) + P(X f \X s ;t)j t P(X s ,t) 



= Y wiX^X^X^PiX^X'^PiXlt) 

x>,xi 

- Y w(X' t ,X'.\X t ,X,;t)P(Xt\X.;t)P(X t ,t). (1.29) 
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Due to 

P(X s ,t) = J2P(Xf,X s ;t) (1.30) 

Xi 

(cf. (1.14)) we have the relation 



(1.31a) 



with the effective transition rates 



Xi x; 



(1.31b) 



Whereas w(X'\X;t) may be symmetrical (cf. (1.88)), w{X l s \X s \t) is normally 
asymmetrical because of the dependence on P(Xf\X s ]t). 

Now let us assume that X S) j are just the slowly changing macroscopic va- 
riables and Xf k the fast varying microscopic variables. From the mathe- 
matical point of view, fast variables and slow variables differ in the order 
of magnitude of the transition rates (which will, in the following, be assumed 
time-independent). Concretely, let 



w(X' { ,X' s \X u X m ) 



e(X' u X' s - X u X s )w(X' u X s \XuX s ) 



for X' s ± X s and 



This implies 






(1.32a) 



(1.32b) 



w(X' { ,X'.\Xt,X.) « w(X' t ,X.\Xt,X,)6x.xi • (1-33) 



Consequently, the right-hand side of equation (1.31a) vanishes and (1.29) takes 
on the form 

f t P(x { \x s -t) « 53u>(jr fl jr.|^,jr.)p(xj|jr.;0 

x ‘, 

- Y,iv(X't,X,\Xt,X.)P(X t \X.;t). (1.34) 



For P(X Sl t) not to be quasi-constant, a slower time scale must be chosen by 
means of the transformation 



t s := d . 



(1.35) 
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Then, with the notations 



Ps(X s ,t s ) := P(X a ,t a /e), (1.36a) 

Ps{Xf\X a ;t s ) := P(X t \X,;U/e) , (1.36b) 



and 



«, S M* S ;< S ) := 

= E ^ X ' S ’ ^ ^ w ( X 'f’ X *\ X <’ Xs)P*(X ( \X a ; t.) , (1.37) 

x t x\ e 

master equation (1.31a) reads 



-^P a (X a> t a ) = [ Ws (X s |X; ; < s )P s (^,< s ) - tflb(Xi|X.;<,)P.(X S) i.)] 

(1-38) 

in which the subscripts ‘s’ are normally suppressed. Note that (1.38) is consi- 
derably simplified with regard to the number of variables. 

Let e be of the order of magnitude of the average of the quantities 
c(Xf,Xs;Xf,X s ). Then the time scale t s corresponds to the average ‘speed 
of change’ of the slow variables X s whereas the fast variables Xf, com- 
pared to the time scale t s , relax quasi-adiabatically. Their distribution 
P(Xf\X s ;t s ) « P(Xf\X s ) is given by the slow variables X s , i.e. the fast 
variables are ‘slaved’ by the slow ones (cf. Ref. [107], pp. 204f) because it is 

j t P(Xf\X s -,t) = ^jLp s (Xt\X s ;t a ) = e-±-P a (X { \X s] t a ) « 0. (1.39) 

With (1.34) this implies 



0 « ^u>(X fl X.|Xj l X a )P 1 (Xj|X.;t.) 

x < 

- ^2w(X' ( ,X a \X { ,X a )P a (X f \X a ;t a ). 

x 'i 



Obviously, the conditional probabilities P s (Xf\X s ;t s ) are, due to the quasi- 
adiabatic relaxation, almost time-independent on the time scale t s . Equation 
(1.40) allows the determination of the conditional probabilities 



P s (X f \X s ;t s ) k P 8 (X { \X S ) 



(1.41) 
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which are necessary for the evaluation of the macroscopic transition rates 



w s (x;|x s ;< s )««; s (x;|x s ) 



(1.42) 



Due to the dependence of the transition rates u; s (Xg|X s ) on P(Xf|-X’ s ), the 
dynamics of the variables X s can again be described only in a stochastic way, 
even if the transition rates w(Xf ) X s \X , i) X , s ) are deterministic. Hence the va- 
riables X s are subject to fluctuations which stem from the quasi-adiabatically 
eliminated variables Xf. 



1.2.4 Derivation from Quantum Mechanics* 

The Statistical Operator 



Within the framework of non-relativistic quantum mechanics the microscopic 
dynamics of a system consisting of many subsystems is described by a sta- 
tistical operator (density matrix) p(t) which obeys the von Neumann 
equation [66,195] 




= 7tp-p7t 



(1.43) 



H is the so-called Hamilton operator of the system describing its energy. 



The statistical operator allows the evaluation of the expected value (mean 
value) (.4) of an observable (measurable quantity) 



.4 = £>x|X)(X| 

X 



(1.44) 



by means of 

(A) = Tt(Ap) = 52<y\Ap\Y) 

Y 

= £X>x(Y|X)(X|p|Y) = Y,Ax{X\p\X) 

Y X X 

— y ^A x Pxx • (1.45) 

x 

Here the states |X) were chosen in such a way that they form an orthonormal 
basis of the Hilbert space belonging to the total system and that they are 



This section can be omitted during a first reading. 
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the eigenvectors of the observable A to the eigenvalues Ax • Hence we have 



£l*X*l = J > 

X 



(1.46) 



(X|Y) = ^y| , 

and 

A\X)=A x \X). 



(1.47) 

(1.48) 



X denotes a vector that is normally composed of various quantum numbers. 



The matrix elements 

possess the properties 
and 

(cf. Ref. [66]). Hence 



p XY (t) := {X\p(t)\Y} 

Pxx(t) > 0 

£ Pxx (0 = 1 
x 

P(X,t) ~Pxx(t ) 



(1.49) 

(1.50) 

(1.51) 



(1.52) 



can be interpreted as the probability to find state |X) when observable A is 
measured. The formula (1.45) for the expected value (A) has thus the familiar 
form 

(1.53) 



(A) = '£A x P(X,t) 

X 



The Nakajima-Zwanzig Equation 



As relation (1.45) shows, the actually relevant part of the statistical operator 
p = (pxy) i s the diagonal part 

Pi : = (i Pxx s xy ) • (1-54) 

Therefore, from the von Neumann equation we will derive a reduced equa- 
tion for the smaller number of relevant variables. For that we proceed in the 
following way: 



First, we write the von Neumann equation in the Liouville form [171] 




(1.55a) 
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in which the linear tetradic operator (superoperator) £ = ( Lxyx'Y •) is 
given by 

(1.55b) 



Here B is an arbitrary operator which is defined in the Hilbert space of the 
total system. More explicitly, (1.55b) can be written in the way 



CB:= \{HB-BH) 

rt 



{£J3)xy = y^y ^Lxyx'Y'Bx'y 1 



with 

Lxyx'Y 1 = -^(Hxx'^yy 1 - Sxx'Hy’y) ■ 
For tetradic operators we have the multiplication rule 

{CM)xyx>y> = EE Lxyzz'Mzz'X'y 1 

z Z' 



and the representation 

( Dxyx'y ' = Sxx'fiYY' 
of the identity operator J_ (of unity). 

By means of the Laplace transformation [218] 



9(u) := J dte ui p(to + t) = J dte <0 ^p(t) 

0 to 

equation (1.55) can be converted into 

uG(u) - p(t 0 ) = -i CQ(u) 
which is formally solved by 

G(u) = («i+ i£) _ V(to) • 



(1.56a) 

(1.56b) 

(1.57) 

(1.58) 

(1.59) 

(1.60) 
(1.61) 



In order to separate out the diagonal part pi of the statistical operator p we 
decompose this into two parts 



p = pi ■+ P2 



(1.62a) 



with 



pi-Vj> 



(1.62b) 
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and 



Pi := Qp 



Q:=l-V 



Here the tetradic (projection) operator V_ is defined by 

PXYX'Y' := SxY&XX'&YY' 



(1.62c) 



(1.63) 



and fulfils the relations 



vv = v 


) 


VQ = 0 




QQ = Q 



(1.64) 



The decomposition of the statistical operator p into a relevant part pi and an 
irrelevant part p 2 also splits up the Laplace transform G(u) into two contri- 
butions 

G(u) = G 1 (u) + g 2 (u) (1.65a) 

where 

Gi(u) := VG(u ) , g 2 (u) := QG(u ) . (1.65b) 

Applying V and Q to (1.60), we obtain two equations: 



uGi(u) - Pi(to) = -iVCGi(u) - i VCG viu) (1.66a) 



and 

ug 2 {u) - p 2 (*o) = -iQC£i(u) - i QCG 2 {u) . (1.66b) 

Solving (1.66b) with respect to G 2 (u) we find 

02(tt) = (ul + iQC) -1 p2(to) - i(ui + iQC^^QCQiiu) . (1-67) 

Inserting this into (1.66a) yields 

uGi(u) - pi(t 0 ) = -i VCt 7 i (u) - i VC(ul + \QC\~ 1 p->(tn) 

-2L(ul + iQCr'QCg^u ) . ( 1 . 68 ) 



By means of the inverse Laplace transformation 




(1.69) 



we finally get the Nakajima-Zwanzig equation 

t 

j t Pi(t) = -\vcpi{t)~ j 

V2O0) 



-i VCe 



to 



(1.70) 
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by using the convolution theorem (cf. p. 295) and taking into account that 
the inverse Laplace transform of [ul_ + iB]~ l is exp[— i B(t — to)]. If the obser- 
vable A has been measured at the initial time to we may assume the initial 
statistical operator to possess a diagonal form (in the basis of the eigenstates 
of ,4), i.e. 



Pl(to) = p(to) 



and 



P2(io) — 0 



(1.71) 



The last term in (1.70) vanishes, then. There remains the equation 



j t Pi(t) = -i 2£pi(t) ~ J dt'VCe-^-^QCpiit') 
to 



(1.72) 



which is a closed equation for the relevant part pi(t). 



The Generalized Master Equation 

In order to obtain an explicit form of the Nakajima-Zwanzig equation we 
decompose the Hamilton operator 



W = £ Y, h *y\ x ){Y\ 



X Y 



into two parts: 

H = n 0 + ni. 

Choosing the so-called energy representation defined by 



n 0 :=^Hxx\X)(X\ 



, i.e. (Hq)xy = Hxx^xy , 



and 



Hi :=n-H 0 , 

the LlouviLLE-Operator splits up into two contributions 

C — C 0 + Ci 

with 

{Lo)xyx'Y' = ^('Ho)xx'^yy' - ^xx'('Ho)y , y^ 

= t(Hxx — Hyy)Sxx'Syy' 
n 



(1.73) 

(1.74a) 

(1.74b) 

(1.74c) 

(1.75a) 



(1.75b) 
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and ^ 

(£.1 )xyx'Y' = {('H^xx'^yy' — (>xx , ('Hi)y'Y 




(1.75c) 


Then we have 




Q£o — Q. — £o Q • 


(1.76) 


Besides, for an arbitrary Hamilton operators 7i we find the relation 


TCP = 0 


(1.77) 


because of 




(V£P)xyx'Y' = SxyLxxx'X'Sx'Y' 


(1.78) 


and (1.56b). Due to 




PCo i = VCPo . 


(1.79) 



the first term in (1.72) vanishes and there remains 

t 

■j t Pxx{t) = j dt' [£(< - t')pi(t')]xx 

to 

t 

= / dt ' y^,Kxxx'X'(t - t')px>x'(t') (1.80) 

J Y , 



with 

K(t) := -VCe~ i£0 QC 

= -VCQf'^^QCP. 

These conversions base on (1.64) and the Taylor representation 




Additionally, with 

^ Kxxx’X’jt) = - EEEffi* - '^' Q^XXYY’LyY’X'X 1 
X 1 X 1 Y Y 1 



and 



J2 Lyy 

X' 



'X'X' = K 



| y ^Hyx'Sy 

\ X' 



>x> 



y ^byx'Hx'Y' 

X' 



= 0 



we can prove the relation 



(1.81) 



(1.82) 



(1.83) 



(1.84) 



y ^Kxxx'x'jt) = 0 . 
x> 



(1.85) 
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This allows the representation 

Kxxxx(t) = - ^2 Kxxx'X'(t) = - ^2 Kx'X'xxtf) (1.86) 

x’(*x) x'&x) 

where the symmetry of /C according to (1.81) was used for the last conversion. 
Defining 

w t (X\X / ) := Kxxx>x'(t) 
as transition rates, we obtain 



( 1 . 88 ) 



and (1.80) takes on the form of the generalized master equation: 




(1.89) 



For Wt-t^X'lX), the assumption 

w t - t '(X'\X) ~ u;(X / |X)e" A( ‘"‘ ,) (1.90) 

often is a good approximation since, then, the influence of the past decreases 
exponentially with the distance \t — t'\ to the present. In the MARKOvian 
limit 1/A — ► 0 of ‘short memory’ we get 

wt- t '(X' \X) = w(X'\X)6(t f - t ) . (1.91) 

This implies that the generalized master equation turns into the ordinary ma- 
ster equation given by (1.5a) [156]. For reasons of clarity the next chapters 
exclusively treat ordinary master equations . The results for these can, without 
any complications, be transferred to the case of generalized master equations. 
Then we arrive at generalized Boltzmann equations [47], generalized 
Fokker-Planck equations [241] as well as generalized Boltzmann- 
Fokker-Planck, mean value, and covariance equations which differ 
from the corresponding ordinary equations only by an additional time integra- 
tion. This time integration allows the inclusion of ‘memory effects’. 



w t (X\X') = w t (X'\X) 



(1.87) 



Adiabatic Elimination of the Fast Variables 

The derived master equation contains an enourmous number of variables Xj 
of which only a few macroscopic variables X S) j £ {Xj} are of interest. Hence 
let us subdivide the variables {Xj} into variables {X S) j} and {Xf } k}- 

X = (Xf,X s ) tT . 



(1.92) 
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Now we look for an equation for 



P(X a ,t) :=£P(X f ,X s ;<) 

Xf 



(1.93) 



It is reasonable to assume that the macroscopic variables X S) j change slowly 
whereas Xf summarizes the fast changing microscopic variables. As a conse- 
quence, the variables Xf } k can, like in Section 1.2.3, be adiabatically eliminated 
so that a master equation for the macroscopic state vector X a remains. In con- 
trast to the microscopic transition rates w(X* \X) the macroscopic transition 
rates w s (Xg|X s ) are , due to (1.37), usually asymmetrical. For example, the 
forward and reverse reaction rate of a chemical reaction are in general 
different from each other (cf. Ref. [47]). 



1.3 Properties 



1.3.1 Normalization 



From 

= J2w(X\X';t)P(X',t)- 

X X t X’ X t X‘ 

= £ w{X\X'-,t)P{X\t)~ Y, w{X\X'-t)P{X',t) 

X,X' X‘ y X 

= 0 (1.94) 

we conclude that the total probability P(X,t) of the system to be in any 
of the states X E T is constant with time. Usually the probabilities P(X,t) 
are normalized to 1: 



!>(*>*) = i 



(1.95) 



1.3.2 Non-Negativity 

If condition P(X,to) > 0 is fulfilled at an initial time to for all states X, for 
all later times t > to we have 



P(X,t)> 0 



VX6f. 



(1.96) 
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In order to prove this let us assume that a point in time t 1 > to exists at which, 
for some state X " , the function P(X,t) changes its sign for the first time, i.e. 

P(X",t') = 0, 4 p (X" ,0 < o . (1.97) 

at 

Then, in contrast to this, we would find 



_d 

dt 



P(X",t') 



Y \w{X"\X'] t')P(X’, t') - w(X'\X";t') • 0 

X 1 



> 0 (1.98) 

because of w{X\X , \t) > 0 and P(X* ,t') > 0. 

Therefore, together with the normalization condition, the interpretation of 
P(X , t) as the probability of state X to occur at the time t is justified. 

1.3.3 The Liouville Representation 

Master equation (1.5) can also be written as a linear system of differential 
equations 

±P(X,t) = YLxx>(t)P(X',t ) (1.99) 

at x , 

or more briefly as a Liouville equation [171]: 






(1.100a) 



Here L = (Lxx 1 ) is the Liouville matrix with the components 



Lxx‘{t) := w(X\X f ;t) - 6 X x 1 ^2w(Y\X f ]t) 

Y 



(1.100b) 



and P(t) is a vector with the components P(X, t). In the continuous case (1.5b) 
the Kronecker symbol bxx> is to be replaced by Dirac’s delta function 
6(X- X') (cf. p. 301). 



Time- Independent Transition Rates 

In the following let the transition rates w(X , \X]t) be time-independent or of 
the form 



w{X'\X;t) := f(t)-w\X f \X). 



( 1 . 101 ) 
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In the latter case (1.5a) can, by time scaling 

( 1 . 102 ) 

be transformed into the master equation 

t) = £ [u/(X\X , )P'(X', r) - w'(X'\ X)P'(X, r)] (1.103) 

for P'(X,r) P(X,t(r)) with time-independent transition rates w^X' \X). 

This follows from 

dP{X,t) = dP{X,t{T)) dr = dP(X,t(r)) = dP’jX'T) 

dt dr dt dr ' dr ' * v • 

1.3.4 Eigenvalues 

By separation of variables, i.e. 

P(X, t) := P x (X)e X{t ~ to) , (1.105) 

(1.100) transforms into the eigenvalue equation 

LP X = A P x (1.106) 

where P x denotes the eigenvector to the generally complex eigenvalue A. It 
can be shown that the real parts of the eigenvalues A are non-positive (cf. Ref. 
[107], pp. 92ff): 

(1.107) 

If there were an eigenvalue A with Re(A) > 0, the corresponding solution would, 
in accordance with (1.105), exponentially grow with time and violate (1.95) 
from a certain time on. 

It is the stationary solution P(X,t) = P°(X) which is given by 
dP(X,t)/dt = 0 for all X and all times t > to that belongs to the eigenvalue 
A = 0. The existence of at least one stationary solution and, consequently, of 
the eigenvalue A = 0 is guaranteed since the rows of the Liouville matrix 
L = (Lxx 1 ) are linearly dependent (cf. p. 291) due to 

X X X Y 

= EuKXIXO-EuCYIX') 

X Y 

= o. (1.108) 

On certain conditions the stationary solution is also unique (cf. Sect. 1.3.5). 
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1.3.5 Convergence to the Stationary Solution 



In order to show that P(X,t) converges to a stationary solution P°(X) for 
time-independent transition rates, we introduce the entropy-like Liapunov 
function 



I<{t) :=£P(X,01n 

X 



P(x,t ) 
P'(X,t ) 



(1.109) 



Here let P(X,t) and P'(X,t) be two solutions of the master equation for 
dilferent initial distributions P(X,to) and P'(X,to). Due to 



£p(X,t) = l, ]Tp'(X,<) = 1, (1-110) 

X X 



and 

Inz < z — 1 



we immediately find 



*(0 = £*W) 

X 



• P(X,t) P'(X,t) 
. P'(X,t) ^ P(X,t) 



> 0. 



( 1 . 111 ) 



( 1 . 112 ) 



Moreover, we can derive 



djK 

dt 



dP(X,t) ( ln P(X,t) A dP'(X,t) P(X,t) 



dt y“p'(x,t)'y dt p'(x,t) 

= ( ln ^(l ^ + 1 

- (w{X\X')P'(X',t) - w(X'\ X)P'(X,t))P^ 

= EE W ( X I X')P(X',t)(lnz-z+l) 

X X 1 

< 0 (1.113) 



with 



z = z(X,X , ;0:= 



P(X,t) P'(X',t ) 
P’(X,t) P(X',t ) 



(1.114) 



Since K(t) decreases monotonically due to (1.113) and has a lower bound bec- 
ause of (1.112), K(t) approaches a limit value for t — ► oo (cf. Ref. [207], p. 32) 
which is characterized by 



(1.115) 
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(1.115) is fulfilled if and only if z(X,X'\t ) = 1 holds for all pairs (X,X ; ) for 
which w(X |X') ^ 0. Now, by presupposing w(X\X') / 0 for all (X, X') with 
X ^ X' and choosing for P'(X,£) a stationary solution P°(X), we get 



P(X,t) P(X‘,t) 
P°(X) P°(X') 



const. = 1 . 



(1.116) 



Therefore , every solution P(X,t) of the master equation converges to the sta- 
tionary solution P°(X) for t — ► oo. In particular , this means that P°(X) is 
the only existing stationary solution. 



1.4 Solution Methods 

1.4.1 Stationary Solution and Detailed Balance 

In order to evaluate the stationary solution we must solve the linear system of 
equations 



0 = Y, L XX‘P°(X') 

X' 

= ]T [w(X\X')P°(X') - W (X / |X)P°(X)] . 

X ' 



If, in addition to this, the condition 



w(X\X')P°(X') = w(X'\X)P°(X) 



v x\x g r 



(1.117) 



(1.118) 



is fulfilled (i.e. the flows from states X' into states X and the inverse flows 
from states X into states X' cancel out each other separately, not only in the 
sum), we call this a situation of detailed balance. In such a case we directly 
obtain as the stationary solution 



P°(X n ) = P°(X 0 )II 
* = 1 



wiXi^Xi) 



P°(X 0 ) 



w{C n ) 

wiCn 1 ) 



(1.119) 



where 

C n :=X o Xi — X n (1.120) 

is a path from Xo to X = X n with non-vanishing transition rates w(X{ |X,_i) 
and u;(Xi_i|Xi) (i = 1, . . . , n). C” 1 denotes the inverse path 



C- 1 := X 



X^X 0 



( 1 . 121 ) 
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of C n . Moreover, for n > 1 the following abbreviations were used: 

n 

w(C n ) := ft «;(*,- IX,-!) , (1.122a) 

1 = 1 
n 

w(C~ l ) := II^«-il^»)- (1.122b) 

Z = 1 

P°(X o) is obtained from the normalization condition (1.95). The result is 

(1.123) 

Obviously, for detailed balance the relation 

(1.124) 

must be fulfilled for all cycles 

C n := X - X l - . . . -> X n _i - X . (1.125) 

Conversely, detailed balance holds if condition (1.124) is satisfied for all cycles 
C n . Since the cycles C n can be decomposed into elementary cycles, it is 
sufficient to prove (1.124) for these. (Cf. Ref. [294], pp. 328ff.) 

An additional property of detailed balance is that the eigenvalues of the corre- 
sponding Liouville operator L are all real and thus non-positive (cf. (1.107) 
and [107], pp. 92ff). This is the case because the eigenvalue equation 

J2Lxx'P\X') = XP x (X) (1.126) 

X' 





can, with 



be written as 



MX) 




Y,L S XX'MX') = XMX). 
X' 



(1.127a) 

(1.127b) 





- 5>(V|X') 

y 

(1.127c) 



Here 
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is a symmetrical (self-adjoint) matrix, i.e. its eigenvalues are real (cf. Ref. 
[20], pp. 235ff, and [165], pp. 255ff). The symmetry results from 



w(X \X f ) 



y/P°(X') 

y/P%X) 



= tu(X|X') 



P°(X’) s/P°(X) 

P°(X) v^X 7 ) 



by using the condition of detailed balance. 



w(X'|X) 



ypyn 

s/p°(x') 

(1.128) 



If detailed balance is not fulfilled, there exist only limited possibilities for an 
explicit solution of the master equation. Worth mentioning are the transi- 
tion factor method [111], the Kirchhoff method [63,249,287], and a 
few others (cf. e.g. Ref. [104]). 



1.4.2 Time-Dependent Solution 

According to the Liouville representation the general time- dependent solu- 
tion of the master equation has the form 

P(X, t) = P x {X)e~ X(t ~ to) (1.129) 

A 

if the eigenvalues A of 

L= (l*x'(t)) = (u;(X|X / ;0-^X'^^(V|X , ;l)) (1.130) 

y 

are not degenerate. Here P x (X) is the eigenfunction of L to the eigenvalue A. 
If degenerate (multiple) eigenvalues occur, the solution is of the form 

P(X,t) = J2 £ ^(*)^(0e _A(t ~‘ o) (1.131) 

A z=0 

where n\ is the degree of degeneration (the multiplicity) of the eigenvalue 
A and Pi(t) is a polynom in t of degree i (cf. Refs. [28,203]). 

Formally the general solution of (1.100) reads 
P(t) = e L(t ~ to) P{t 0 ) 

= ( 1 .132) 

k=0 

and is thus uniquely determined by the initial distribution P(^o)- In the case 
of time- dependent transition rates we have 

P(X, t) = Y ] P(X, t\X 0 , to)P(Xo, to) , 



(1.133) 
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i.e. 

P(t) = P(t,t 0 )P(t 0 ) . 


(1.134a) 


Here the propagator P(£, £q) = ^P(X, t\Xo, to)^ must fulfil the conditions 


j t P{tM)=L{t)-P{tM) 


(1.134b) 


and 




R(to,to) = 1 


(1.134c) 


with the unity matrix 1 = • The formal solution 


is (as one can confirm 



by applying the second fundamental theorem of calculus) 

P_(t } to) = T exp t ) toj (1.135a) 

where 




is the time-ordered exponential function. Unfortunately, this relation is 
not of great use because it can hardly be analysed explicitly. For a numerical 
solution of the master equation often Monte-Carlo methods are applied 
(‘stochastic simulation 9 ) [23]. 

1.4.3 ‘Path Integral’ Solution 

Path Probabilities 

In the following we will introduce a new solution method for the discrete master 
equation which allows, in contrast to the methods that were discussed up to 
now, the calculation of path or process dependent quantities. For its derivation 
we start from equation 

P(X,t + At) = P(X,t) + At ^ w{X\X' -,t)P{X' ,t) 

X'(jtX) 

- At ]T w{X'\ X;t)P(X,t) 

X'(jiX) 

= £>(*,< + A t\X',t)P(X',t) 



(1.136a) 
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which is equivalent to master equation (1.5a) in the limit At — » 0. Here the 
quantities 



P(X,t + At\X',t) = 6 XX ,+At £ [w(X\Y-t)6 x . Y 

Y&X) 

- 6 xx .At J2 ™(Y\X;t) (1.136b) 

Y(*X) 

have the meaning of transition probabilities (cf. (1.21), (1.22)) which define 
the propagator P(t + A t,t) = ^P(X,£ + At|X ; ,t)^ (cf. pp. 42f). Obviously, 
the solution of (1.136) is 



'W>=,1“ E £•••£ 


n 


P(X 0> t 0 ) 


Xn-i X n _ 2 Xo 


.1 = 1 





(1.137) 

where X n := X, t{ := to 4* i At, and At := (t — to)/n. This representation is 
the basis of Haken’s approximate path integral solution [105]. 



In the following we restrict ourselves to the case of time-independent transition 
rates w(X'jX;t) = w(X'jX) and try to derive a contracted form of (1.137). 
In order to achieve this simplification we utilize the circumstance that the 
described system normally does not change its state X,* over many time periods 
At. This means that within a time interval 



T:=t-t 0 (1.138) 

the system will change its state only for a finite number of times. So let Xo, 
X i , . . . , X n be the sequence of states in which the system is found between 
times to and t. 

C n :=X o - . . . - X n _! - X n (1.139) 

with X n = X will be called the path of the system. Furthermore, 

P(Cn,T) = P(X o X n -i — ► X„,T) (1.140) 

will be denoted as the (path) probability with which the system takes the 
path C n during the time interval T. Then the probability P(X, t) = P(X, to + 
T ) to find state X at the time t can be expressed as a sum over all path 
probabilities: 



oo 

P(X,t 0 +T) = '£ ■ 


.. £ P(X o^...^Xn-l^X,T) 


n=0 Xn _ 1 (jt X ) 


Xo&Xi) 



(1.141) 
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This can be abbreviated by 

oo 

p(x,< 0 + :r) = ££p(c n ,T). (1.142) 

n=0 C n 



The path integral solution (1.141) of the discrete master equation is the desired 
contraction of formula (1.137). It was suggested by Empacher [63]. In order 
to calculate the explicit form of the path probabilities P(C n ,T) we will extend 
an idea of Weidlich (cf. Ref. [289], Sect. 7.1). For this purpose, we write the 
master equation, after introducing the matrices 



and 



w= (wxx>) ■■= 



with 



W X x := 0 



(1.143a) 



D= (jDxx'^j '■= (wjcfox') 



with 



Wx := £ «;(y|X) , 

Y(*X) 



(1.143b) 



in the vectorial form 

j t P(t) = (W-D)P(t) (1.144) 

(cf. (1.100)). Wx is the total departure rate from state X, i.e. the total rate 
of transitions to all other states. By applying the Laplace transformation 



we get 
i.e. 



oo oo 

G(u) := J dte~ u{t ~* o) P(t) = j dTe~ uT P(t 0 +T) 

to 0 

uG(u) - P(t 0 ) = (W- D)G(u) , 



(1.145) 

(1.146) 



G(u) — (wl — W + D)* 1 P(to) ■ (1-147) 

The desired result is finally obtained by the inverse Laplace transformation 



c+ioo 

P(t) = P(to+T) - / due uT G(u) (1.148) 

27T1 J 

C—'lOO 



of G(u). This yields 

— c+ioo 

P(t 0 + T ) = -^ / dut~ uT {D-u\-W)- l P(t 0 ) 

Z7T1 J 
— c-ioo 

— c+ioo 
-c-ioo 



(1.149) 
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The geometric series 

oo 

[1 - (D - ul)" 1 ^]" 1 = (1.150) 

n=0 

will be convergent if the arbitrary (but fixed) constant c is suitably chosen. 
Now, with 

[(22 - til )~ l mxx' = YiWx - u)-H xy w{Y \X') = y |X/) (1.151) 

y Wx - U 

and (1.143a), we find 

OO 

E E {[(£ - r'm n (D - m 

n=0 Xo 

= E E - E (1152a) 

"= OX^&X) XofrXi) f[(W Xj - U) 

j= o 

where 



if n = 0 



w(C n ) = w(X 0 



JJ^ilXj-i) ifn> 1. 

a J= 1 

(1.152b) 

Inserting (1.152) into (1.149) and comparing the result with (1.141) we gain 
the explicit expression for the path probabilities P(C n ,T). It reads 



p {Cn>T) = — j du — — w(Cn)P(X 0 ,to) • (1153) 

— c— ioo JJ(Wx, - u) 



We can evaluate this integral by using the residue theorem (cf. [266], pp. 
112ff). If, for example, all rates Wx { are equal (Wxi = Wx), we obtain 



P(C n ,T ) = - '-e- WxT w(C n )P(Xo,to ) 

n\ 



(1.154a) 



If, however, all rates Wx, are different, we get 



P(C n ,T) = E~ 



-w Xi T 



v(C n )P(X 0 ,to) • 



.=o J! (W Xj -W Xi ) 



(1.154b) 
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Obviously, the expressions (1.154) grow poly nomi ally in T for T & 0 and de- 
crease exponentially for T — * oo. 



Path Occurence Times and Their Probability Distribution 

In numerical evaluations of formula (1.141) by means of computers we certainly 
cannot sum up over infinitely many different paths. The summation has rather 
to be limited to those paths which provide the most significant contributions. 
In order to find out which paths are negligible we will calculate the probability 
distribution 

(1.155) 



of the occurence times 2 T of a path C n given that the system traverses this 
path some time. Here 

oo 

P(C n ) := J dT P(C n ,T) (1.156) 

0 

is a normalization factor which reflects the probability with which the path C n 
is traversed at all. By inserting (1.153) into (1.155) we find that the occurence 
time distribution P(T\C n ) depends only on the traversed states Xo, . . . , X n 
but not on their order. 

From the occurence time distribution P(T\C n ) we expect that it will become 
maximal after a certain time interval T: For T ^ T there is too little time 
for traversing the n states. In contrast, for T T the probability is high that 
already further states X n +i, X n + 2 , . . . were traversed. More precisely, we have 

P(T\C n ) T -^ C\{C n )T n (1.157a) 

and 

P(T\C n ) 7 C 2 (C n )e~ WxT (1.157b) 

which can be derived from (1.154). Here Ci(C n ), C 2 (C n ) are constants and Wx 
is the smallest of the rates {Wx 0 , . . . , Wx n } which were assumed to be different 
from each other. 

For the calculation of the mean value (T)c n and variance 

:= ((T- (T)c n )\ = (T 2 ) Cn ~ (<T) Cn ) 2 (1.158) 

2 Each time t = to + T at which the path C n can be found is understood as an occurence 
time T of the respective path. Consequently, the occurence times of a path C n must be 
distinguished from the waiting time which means the time period until the path C n appears 
for the first time. 



P(T\Cn) := 



P(C n ,T ) 
P(Cn) 
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of the occurence times T of path C n we require the quantities 



oo 

(T k ) Cn := J dTT k P(T\C n ). 



(1.159) 



By means of the residue theorem (cf. [266], pp. 112ff) we find 

co oo — c+ioo —uT 

J dTT k P(C n ,T) = J dTT k E j du — — w(C n )P(X 0 , <o) 

o 0 — c— ioo — u) 



i r (*Y fv,- 

-<L f[{W x ,-u, ' ' l 



= w(C n )P(Xo,to)-^r J du ^ 

— c— ioo u *+i f[(W x , - u) 



- w(C n )P(Xo,to) ^ 



Thus we obtain 



P(C„) = «;(C„)P(Xo ) <o)n7^ 



J " L Wx 

»=o 



(1.160) 



(1.161) 



oo 

(T)c n := J dT TP(T\C„) = E ^ , (1162) 



OO n 

„c. :=jdr(T- f r>c.)VlC ») = E 



(1.163) 



These results can also be derived from (1.154b) by means of the formulas 



dT T*e 



k~Wx,T _ 



(W x ,) k+1 



(1.164) 
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and 



n 



E 



1 

ft (W X} - W Xt ) 

3 = 0 

(J*0 



= 0 



(1.165) 



Assuming that the rates Wx { are equal ( Wx t 

relative width 

x/^cT _ 1 

(T)c n \/n + 1 



Wx) we recognize that the 
(1.166) 



of distribution P(T\C n ) becomes smaller with increasing number n of transiti- 
ons. 



When calculating approximate solutions for P(X,t) = P(X,to + T), only 
such paths C n will be taken into account which fulfil the condition 



I T-(T) Cn \<a 




(1.167) 



with the accuracy parameter a. If, for example, we choose a = 3, about 
99% of the distribution function P(T|C n ) will lie between T\ := (T)c n 
and T 2 ( T)c n + &\Jvc n - As a consequence, for any given time t = to + T 
those paths which satisfy condition (1.167) will reconstruct about 99% of the 
distribution function P(X ) to + T) defined by formula (1.141). 



Applications 

Compared to other solution methods the advantage of the path integral solu- 
tion is that it allows to determine process or path dependent quantities. For 
example, we can calculate the probability with which the considered system 
takes ‘desired’ paths that consist of selected states only. In the same way we 
can evaluate the probability with which the system visits an unwanted state 
in the course of time and thus possibly takes a ‘catastrophic’ path. These 
probabilities are of special interest for prognoses (e.g. scenario techniques) 
or for the control of the temporal behaviour of a technical or other system 
(process control). 

Another interesting application area is the event analysis (survival analy- 
sis) [51,275]. Here one investigates the distribution of waiting times until 
a certain event happens, i.e. a certain state is reached, and the paths taken. 
In the context of social sciences the events (states) describe personal circum- 
stances. By means of (1.141) and (1.154) it is now possible to carry out even 
complex event analyses for which the distinction of many different states is 
necessary or for which the states are not traversed in a given order (but pos- 
sibly several times). For the description of the event histories of individuals, 
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however, one often requires time- dependent transition rates w(X f \X]t). Ne- 
vertheless, the mathematical results of this section can be expanded to such 
cases if the time-dependence of the transition rates has the special form (1.101). 



1.5 Mean Value and Covariance Equations 

In most cases the master equation is used for the delineation of the dynamics 
of the individual elements of a system and thus corresponds to a microscopic 
level of description. However, not only because the calculation of the pro- 
bability distribution P{X ) t) is very laborious one is often merely interested in 
the mean value and covariance equations which are derivable from the master 
equation. These reflect the collective behaviour of the elements and, therefore, 
are also called macroscopic equations. Insofar, the master equation allows 
a microscopic foundation of the macroscopic dynamics. 

Having defined the mean value (f(X,t)) of a function f(X,t) as usual by 
(f)t = ( f(X , <)) := £ f(X, t)P(X, t) , (1.168) 

X 

we directly obtain, from the master equation (1.5a), the mean value equati- 
ons 

d(Xj) _ dP{X,t) 

dt x 1 dt 

= £L [x lW (X \X'\ t)P(X', t ) - XMX'IX] t)P(X, <)] (1.169) 

X X 1 

= -Xi)w(X'\X;t)P(X,t) (1.170) 

X X' 

= ^mjpr.o/W), (1.171) 

X 

i.e. 

(1.172) 

In the derivation of (1.172) we interchanged the order of X and X 1 from (1.169) 
to (1.170) and introduced the quantities 

with A X'j—X'j-X! (1.173) 
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which are called the first jump moments 3 (cf. Ref. [151], pp. 130fF). In com- 
plete analogy, for the covariances 



<r,j = <ru{t) := ((X/ - (X/) t )(X., - (Xj) t )) t = (XjXj) - <Xj)(X,) , 

(L174) 

we find the covariance equations 



^ = (mu(X,t)) + ((X, - <Xj))m,(X,f)) + {(Xj - {Xj)) mi (X ,t)) 

(Ll75; 



with the second jump moments 



mjj(X, t) := 2^AX;AX>(X'|X;<) 

X' 



(1.176) 



(cf. Ref. [151], pp. 130ff). 

Equations (1.172) and (1.175) are no closed equations because they depend 
on the unknown mean values (mi(X,t)) and (mu(X , £)) respectively. Howe- 
ver, if the distribution P{X ) t) is sharply peaked , i.e. as long as all variances 
cr// are small , a first order Taylor approximation can be carried out. Then, 
with 



(f(X,t)) « (/((X), <) + £(*./ - = f{{x)>t) (L177) 



and 

((X, -<*/»/(*.*)) 

* (i x i - (Xj))f((X),t) + (Xj - (XjV^Xk - ( x K)) df Qlxly -) 

(1.178) 



\ IK d < X K) 



closed equations result, namely the approximate mean value equations 



at ~ 



m i({X), t) 



(1.179) 



3 Some authors denote the jump moments as derivate moments or transition 
moments. 
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and the approximate covariance equations 



d<ru 

at 



mjj((X),t) + 




dmj((X),t ) 
d(X K ) 



+ VJK 



dmj((X),t) \ 

d(X K ) ) ' 



(1.180) 

The assumption of small variances is often justified since in many cases the 
state Xq at a initial time to is known by measurement or preparation, i.e. 



P(X ) t 0 ) = Sxx o • 

Consequently, the variances are zero at the initial time to and remain small 
at least for a certain time interval which can be determined from the appro- 
ximate covariance equations (cf. Sect. 10.4.1). A considerable increase of the 
covariances in the course of time normally indicates a phase transition, i.e. 
a sudden change of the macroscopic system properties (cf. Sect. 10.4.1). 

In the one- dimensional case the formal solution of the approximate variance 
equation can be written as 



<*4. 

Ill 

3 

II 

o 


[^d)(Wt)i 

ki)(Wto)J 


2 (*)« , 

+ Ki)(P00] 2 / dx' m(2)( *° 3 
m,. [”><■><*'>] 


(1.181) 



where ( X) t satisfies the approximate mean value equation 






(1.182) 



(cf. Ref. [103], p. 74). Here m (i) denotes the first and m( 2 ) the second jump 
moment. 






2 BoLTZMANN-Like Equations 



2.1 Introduction 



In the following we will assume to be confronted with systems that consist 
of N elements (subsystems) a. If the dynamics of the system is mainly 
given by pair interactions of the elements (like e.g. in a rarefied gas) it is 
plausible to use Boltzmann equations for its description. A typical example 
of application is a chemical reaction 

B + C^BC (2.1) 

where the different states x a E {B, C} distinguish whether an atom a is boun- 
ded or not (cf. Refs. [11, 167]). Apart from pair interactions, the more general 
; BoLTZMANN-like equations 9 additionally take into account spontaneous 
transitions and are the main topic of Chapters 7 and 8. Interrelations and 
differences between spontaneous transitions and direct pair interactions will be 
discussed in detail in Section 2.5. 

The Boltzmann equation, like the master equation, has a structure of the 
form (1.1). However, the flows of the Boltzmann equation are proportional to 
the probabilities of the states of both interacting subsystems. They depend on 
the square of the probability distribution and thus the solution is difficult. In 
the case of the gaskinetic Boltzmann equation the stationary solution can 
be found by means of the /7-theorem. An evaluation of the time- dependent 
solution is usually very complicated and bases, in most cases, on a recursive 
method. 

The derivation of the BoLTZMANN-like equations can be subdivided into the 
following steps: First, we combine the states 

X a = (27al) • • • > #a*> . . . , X am ) E Joe (2*2) 

of all N elements a in the state vector 

X = (...,JC/,...) tr .ajv)* 1, (2.3) 

of the total system which has M components (M := miV, I := (a,i) tr ). 
y a means the set of all possible states x of element a and will be denoted 
as 7 a -space. For reasons of simplicity we have assumed by (2.2) that the state 
space y a has the same dimension m for each subsystem. This is no restric- 
tion since, if a system can be characterized by a smaller number m a < m of 
variables, the surplus components x a j with m a < j <m can be set to zero. 
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Next we confine ourselves to spontaneous transitions and pair interactions, i.e. 
simultaneaous interactions between three or more subsystems will be neglected 
(though a treatment of these is also possible; cf. Sect. 3.4 and Refs. [40,41, 
277]). The resulting equations can be reduced to equations for the probabi- 
lity distributions P a (x a ,t) of the subsystems a but they depend on the pair 
distribution functions P a p(x ai xp\ t). In order to obtain closed equations we 
assume the factorization of the pair distribution functions (i.e. the statistical 
independence of the elements). Corrections for cases where this assumption 
is not justified are the topic of Section 3.4. 

Finally, if we distinguish only a few types a of elements, the number of equa- 
tions reduces drastically. The result are the BoLTZMANN-like equations. 



2.2 Derivation 



First, for the derivation of the BoLTZMANN-like equations from master equation 
(1.5) we introduce the vectors 

X a :=(0, 0,...,0) tr , (2.4) 

k 

X ai - ak :=]Tx ai , (2.5) 

* = 1 

and 

Y ai ... ak :=X- X ai - ak + Y ai - ak . (2.6) 

They allow the decomposition of the transition rates 



w(X'\X;t) 



' w ai (Y a '\X a '-,t) 
w aia2 (Y aia3 \X aia3 ;t) 



< 



«v...«„(v ai -“" \x a >- 



if X' = Y, 



a l 



if X' = Y ai 



at 



;<) if X' =Y ai ... 



C*N 



0 



otherwise 



(2.7) 



into contributions 

w ai ... ak (Y ai - ak \X ai - ak ; t) = w ai ... ak (y ai ,...,y ak \x ai ,...,x ak ;t) (2.8) 

which describe the interactions between k subsytems. In order to exclude self- 
interactions of an element we set 



w ai ... afc =0 if two subscripts a,-, aj agree. 



(2.9) 
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For most chemical reactions [11, 167] and many other cases the contributions 
with k < 2 are the most important ones because simultaneous interactions 
between three or more elements are rare. Hence the terms with k > 2 will be 
neglected in the following. Then master equation (1.5a) obtains the form 

P Vp 

+ I EE U7(^ 7 l^ 7 i0W7,0-^7(^ 7 l^ 7 ;0^.<)' 

/?,7 1 >, 

= E£[ W 0 (x/}iVp;t)P(Y 0 ,t) - wp(yp\xp; t)P(X, f)] 

P Vf> 

Pn yp>y-r 

-wpy(y 0 ,yy\ x fhX J ]t)P(X,t)j . (2.10) 

The factor 1/2 results from 

E =h E < 211 > 

' ai, - 

(ori<. ..<«*) (ai^aj) 

and ensures that identical transitions will not be counted twice. (There 
exist fc! permutations (interchanges) (/?!,...,/?*) of the k elements E 

{ai,...,ajfe}). 

Now let us introduce the multi-particle distribution functions 

(2.12a) 

for the k particles aq, . . . , a* and introduce the notation 

Pi.n (*i , ...,x N ;t) = P(x j ,...,x N ;t). (2.12b) 

The summation of (2.10) over all E 7/? with /? ^ a (using relations which 
are analogous to (1.94)) leads to 

— Pfx^Xa^t) — 'y ] ^u> a (x a ||/ a ; t)P Qf(l/ a ^) ~ ^(l/al^a? fyP ot( x ay pj 

+ E EEE“ a p(x a ,xp\y a , yp; t)P a p(y a , y p \t) 

y<* Ip yp xp 
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-£££» apiVa’ Vpfaot} X P > ^)Pap(x a ,Xp] 0 
P y» */> 



(2.13) 



with 

£/»«(*«,*) = 1- ( 214 ) 

X a 

Due to the dependence on the pair distribution functions P a p( x an x p\ t) 
equations (2.13) are not closed. The equations for the pair distribution func- 
tions which can be derived from (2.10), however, depend on the three-particle 
distribution functions P«/? 7 (»«,*£, a j 7 ;£) etc. As a consequence, a hierarchy 
of non-closed equations (cf. Ref. [170], pp. 74ff) results so that an approxi- 
mation becomes necessary. Assuming a factorization of the pair distribution 
functions of the form 



P otpfeon x p\ t) £3 P a(Xon t)P d( x P > 0 



(2.15) 



we arrive at the closed equations 
d 



dj.P<x( x ont) — ^ v fy Potty cn t)-™ a (v a \ x a'l t)P a (x 

an O] 
y<* 

+ £ £££ w a p(x a , */j| y a >yp'i t)Pp{vpyt) J Pa(y a > 0 

IA p VD Xfi ) 

- £££- otptyon Vp\ x on x p’^)Pp( x P > 0 j Pa( x on f) 

V P Vfi / 

(2.16) 



y« 



(cf. Refs. [170], pp. 72fF, and [25,93,94,162,224,240]). ttv(y a |a5 a ; t) describes 
spontaneous or externally induced transitions of element a from state 
x a to state y a . w (X p(y ai yp\x ai xp]t) reflects pair interactions between two 
systems a and /? due to which their states change from x a and xp to y a and 
yp respectively. Introducing the effective transition rates 

»«(*>; 0:=m*'M) + £££» ap{x',y'\x,y\t)Pp{y,t), (2.17a) 

p y> y 

equation (2.16) can be cast into the simple form 

^Pa(x,t) = '^2 ^“(*1*'; t)P a (x', f) — w a (x'\x; t)P a (x, f)j . (2.17b) 



Factorization assumption (2.15) is exactly valid only if two arbitrary sub- 
systems a and /? are statistically independent. It is approximately justified 




2.3. Subdivision into Several Types of Subystems 
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if the interactions are weak (w a p « 0) or if the correlations between two sub- 
systems a and /? due to pair interactions are soon destroyed (cf. also (3.20)). 
Equation (2.15) is often substantiated by the assumption of ‘molecular chaos’ 
[154] (cf. Sect. 8.3.4). For the case that (2.15) is violated there exist methods for 
the calculation of corrections (cf. Refs. [25,54]). Moreover, a generalization of 
the above equations to simultaneous interactions of arbitrarily many elements 
is possible (cf. Sect. 3.4). 



2.3 Subdivision into Several Types of Suby- 
stems 



Usually the immense number N of subsystems a can be divided into A types a 
which are characterized by certain properties, e.g. by the kind of interaction. 
In example (2.1) one type of subsystems would consist of the molecules of sort 
B and a second one of the molecules of sort C. 

Now let N a be the number of elements a E a (i.e. of type a). Then we have 



a=l 



(2.18) 



If subsystems a E a of the same type a are not further distinguished this implies 



and 



7a 


= 7 a 


if 


a € a, 


(2.19a) 


Pc 


= Pa 


if 


a £ a , 


(2.19b) 


W a 


= Wa 


if 


a G a , 


(2.19c) 



U>a(3 



Wab if OL i=- /?, OL e a, P G b 
0 if a = P 



(2.19d) 



where w .. are the individual transition rates. Furthermore, after introducing 
the notations 

(2.20a) 




and 



Nb • Wab if b / a 

(N a - 1) • Wab if b = a, 



Wab := 



(2.20b) 
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from (2.14) and (2.17) we gain 



!>(».*) = i 



and 



^P a (*, t) = ^ [w a (x\x'] t)P a (x', t ) - w a (x'\x] t)P a (x, <)] 

X 1 



( 2 . 21 ) 



(2.22a) 



with the effective transition rates 



W a (x'\x ; t ) := w a (x'\x ; t) + ^ ^ ^ tM*'- y'\x, y, t)P b (y, t) 

b y' y 



(2.22b) 



By distinguishing only a few types of subsystems the number of variables is 
considerably reduced if A <C N or N a >• 1. Thus (2.22) becomes accessible to 
a treatment by computers. 



2.4 Properties 

2.4.1 Non-Negativity and Normalization 

If the condition P a (a;, £o) > 0 is fulfilled at a certain time to for all states a, the 
non-negativity defined by P a (x,t) > 0 will be guaranteed for all later times 
t > to. Moreover, despite the applied factorization assumption (2.15) we can 
derive 



^ ^2 P ^( X ^)J = 0 i i e - X) = const ( 2 - 23 ) 

which allows the normalization of distribution P a (x,t). The corresponding 
proofs are analogous to those in Sections 1.3.1 and 1.3.2. 



2.4.2 The Gaskinetic Boitzmann Equation 

The Boltzmann equation for fairly rare gases is a special case of equations 
(2.22). It results, if one combines in the state 



x := (r,v) 



(or y := (s,t»)) 



(2.24) 
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the place r (or s) and the velocity v (or w) of a gas particle. By types a we 
distinguish different sorts of particles. The spontaneous state changes are 
given by the motion 

(2.25) 

and the acceleration 

( 2 . 

where F a (t) is the force exerted on a particle of mass m a . They are described 
by the transition rates 





v a (r', v'\ r, v ; t) Jj m 0 ^ 6 




(2.27) 

In the Kramers-Moyal representation (cf. Sect. 4) they contribute to 
equations (2.22) with the divergence terms 

(2.28) 

Additionally, state changes result from scattering processes of particles (from 
direct pair interactions). These are described by the transition rates 

(2.29) 

which reflect the collisions of two particles at location r = s and the resulting 
velocity changes. 





With respect to the densities 



\p(a ) r,v;t) := N a P a (r,v]t ) 



(2.30) 



where N a denotes the number of particles of sort a within the considered volume 
V, we have the gaskinetic Boltzmann equation [26, 154, 163] 



—p(a,r,v;t) 

[i r f m 

vp(a,r,v;t) - V„ — —p(a,r,v;t) 

J L m a 

+ Jd 3 v'Jd 3 wJ d 3 w' wl lb (v ) w\v f ) w , )p(a ) r i v , ]t)p(b ) r ) w']t) 
— ^2 J j J d 3 w , w! lb (v , y w , \v,w)p(a,r 1 v]t)p(b,r,w;t). 



(2.31) 
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In most cases the pair interaction rates w' ab are expressed by the differential 
scattering cross section 



d?<r' ah = dP<T' ai (v' ,w‘\v,v>) 



w , ab {v , ,w , \v,w)d z v'd^w' 

II* — 'HI 



Furthermore we have 



Kbi v > w W,V>') = w' ai (v', w'\v,w) 
since the pair interaction rates w a \, are symmetrical (cf. (1.88)): 
Wab{x, y\x', y') = Wab(x', y'|x, y) . 



(2.32) 



(2.33) 



(2.34) 



(This symmetry is a consequence of the time reversal invariance and the iso- 
tropy of space; cf. Ref. [154], pp. 69ff.) Consequently, the gaskinetic Boltz- 
mann equation (2.31) can be written in the form 

j t p(a,r,v,t) = ^2 Jd 3 wJd 6 a^ b \\v -w\\\p(a,r,v'-,t)p(b,r,w';t) 

—p(a, r, v; t)p(b, r,w; <)] . (2.35) 



Here 



d d r l 

—p( a ,r,v; t ) := g}P(a, r, v; t) + V r - [i >p(a,r,v; t)j + V„ • 



*•.(<) 



m„ 



p(a,r,v,t)\ 



~ r,v;t) + v- V r p(a, r, v; t ) + • V„p(a, r , v; t) 

ot m a 



(2.36) 

is the total (substantial) time derivative which describes the temporal 
change of the density p(a, r , v\ t) in a transformed coordinate system which 
moves along with velocity (v,F a /m a ) tT . Obviously, dp(a,r,v;t)/dt is only gi- 
ven by the scattering processes. In this sense, Boltzmann equation (2.35) is 
already determined by pair interactions. In contrast to this, the terms due to 
spontaneous transitions usually cannot be eliminated from the general equa- 
tions (2.22): In the case of non- deterministic transition rates w a (x*\x;t) the 
Kramers-Moyal expansion leads not only to divergence terms but also to 
further contributions which cannot be eliminated by a transformation. 



In the following, equations (2.22) will be called ‘Boltzmann equations 9 if 
these are solely given by pair interactions. If, in addition to pair interactions, 
spontaneous transitions play a role, it is reasonable to speak of ‘Boltzmann- 
like equations 9 . 
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2.4.3 The /^-Theorem for the Gaskinetic Boltzmann 
Equation 

For the gaskinetic Boltzmann equation (2.31) there exists, analogous to Sec- 
tion 1.3.5, a Liapunov function 



H(t) J d 6 xp(a,x;t)lnp(a,x-t) 



(2.37) 



with the property 




(2.38) 



Again, this causes the system to approach an equilibrium. Except for an af- 
fine transformation, H(t) is identical with Boltzmann’s entropy function 



S(t) := -k 



^2 J d?x P(a, x; t) log 2 P(a, x; t) 



(2.39) 



with 



P(a,x;t) := ^P a (x,t) = 



(2.40) 



For the proof of (2.38) we first show 
j6_ \dp(a,x-,t) 



f - E/a 



dp(a,x,t ) 



F a (t) 



P( 



a,*;t)|) 



dt 

= -^2 j <fx\np(a,x\t) jv r |v/?(a,x;t)] + V„ 

+ ^2 j J j j \np{a,x\t)\^w ah {x,y\x\yf)p{a,x'\t)p{b,y'\t) 

a,b 

- W ai (x', y'\x, y)p(a, x; t)p(b, y; <)] d 6 x d 6 x' d 6 y d 6 y' (2.41) 

= ~^2 J d 6 x ^V r [v(p(a,x-,t)lnp(a,x;t) - p(a,x,t)^ 

+ V* (/>(a, x; t ) In />(a, x; <) - p(a , x; <)) j j 

^/// / ln ^ a,x; ^ 

a,b 

- w a b(x', y'\x, y)p(a, x; t)p(b, y; <)] <f 6 x dV d 6 yd 6 y ' . 



+ 



(2.42) 
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The term 




dp{a,x\t) 

dt 



12 ft f d6x P( a ’ x ’ t ) = Yl 

a J a 



dNg 

dt 



dN 

!k 



(2.43) 



vanishes on the condition that the considered system is a closed system which 
implies a conservation of the number of particles. The divergence terms 
can be transformed into surface integrals by using Gauss’ divergence theo- 
rem (cf. (4.16)). Due to 



lim p(a,r,v;t) = 0, (2.44) 

IMI— °° 

however, these provide no contributions. Thus the following remains: 

J j J J ln / , ( a > x ; < )[« 5 ai(x,y|a! / ,y / )/ > ( a . a: '; < Mfr,y';0 

a y b 

- w ab (x', y'\x, y)p(a, x;t)p(b, y; <)] d 6 x d 6 x' d 6 y d 6 y' (2.45) 

= 12 JJ JJ [ln/t>(a,*;<)-ln/)(a,a!';0] 

a,b 

xw a b(x, y\x', y')p{a, x'; t)p(b, y', t)d 6 x d 6 x' d 6 yd 6 j/ (2.46) 

= ^12 //// [ ln / > ( a > x ; 0- ln / > ( a . a;/ ;0 + ln / , ( 6 . y ; 0 - ln / > ( 6 . y '; 0 ] 

xw a b(x, y\x' , y')p(a, t)p(b, y'\ t)d 6 x dV dPyd^y' . (2.47) 

From (2.45) we come to (2.46) by renaming the variables of the second term in 
accordance with x ^ x 1 and y <-+ y' . The identity of (2.46) and (2.47) results 
by interchanging the variables of two interacting particles a, /? and taking into 
account 



m a (y, x\y', x') = wp a (y, x\y', x') = w a p(x, y\x' , y') = w ab (x , y\x' , y ') . 

(2.48) 

On the condition 



j d*x' J <*y w ab (x, y\x', y') = J d 6 x' J d 6 i / w ab {x' , y'\x, y) 



(2.49) 



which is fulfilled due to (2.34) we further have 

\EJJJJ w ab {x,y\x' ,y')\j)(a,x'-,t)p(b,y-,t) 

~ P( a , x;t)p(b, y; i)j d 6 x dV cfy d 6 y’ 
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= \j2jjJJ \wab{x,y\x' ,y')p{a,x'\t)p{b,y'\t) 

- w a b (*', y'\x, y)p{a, x; t)p{b, y; <)j d 6 x d 6 x' d 6 y d 6 y' 

= 0 . (2.50) 



This can immediately be seen by renaming the variables in the second term in 
accordance with x «-»■ x' and 

Now, formally adding in the vanishing term (2.50) to (2.47) and introducing 
the abbreviations 



z a b — z a j(x, y; x , y j t) : — 



p( a , x'\ t)p(b, y 1 ] t) 
p(a, x; t)p(b, y; t) 



z 'ab = Zab(v,w;v',w'-,t) := 



p(a,r,v';t)p(b,r,w';t ) 
p(a,r,v;t)p(b,r,w,t ) 



we find the relation 



(2.51a) 

(2.51b) 



dH 

dt 



\ZJJJJ p(a,x'>t)p(b,y,t)(z ah lnz ai - Zab + l) 

a,b 



xw a b{x,y\x' ,y')d^x d 6 x' d 6 yd 6 y l 



■ -&/////*- v ; »•. w; 0 (^aJ In + l) 

a, 6 

tn Iv', w f )d 3 v d 3 v 1 d 3 w d 3 w f d 3 r 

< 0 (2.52) 

because of 

Zab ln z ab - Zab + 1> 0 for > 0 . (2.53) 



Consequently, a system described by the gaskinetic Boltzmann equation 
(2.35) moves towards an equilibrium. However, without the validity of re- 
lation (2.49) a Boltzmann equation can also show an oscillatory or chaotic 
behaviour (cf. Sects. 8.3.2 and 8.3.4). 



2.4.4 Solution of the Gaskinetic Boltzmann Equation 

According to (2.52), the relation dH/dt = 0 presupposes that the equilibrium 
condition z' ab = 1 is fulfilled. This implies the condition 



+ M w ) “ ^a(v') + i>b(w') 



(2.54a) 
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with 




ip a (v) :=ln p(a,r,v). 


(2.54b) 


Obviously, (2.54) is satisfied by the collisional invariants 




mass m a , 


(2.55a) 


momentum m a Vi , 


(2.55b) 


m a „ „2 m a \2 

energy — ||v|| =^2Jvi) , 


(2.55c) 



and linear combinations of these. Inserting 

i>a(v) := c\m a + cl ti m a Vi + ^(v,) 2 (2.56) 

* * 

into (2.54) and representing the density of particles of sort a at location r by 
p{a , r) we obtain 

(2.57a) 

with the Maxwell-Boltzmann distribution 




P a (v) := 






) (25rtr a ) 



-(v-(v)«) 3 /(2<7«) 




(2.57b) 

Here ( v) a has the meaning of the mean velocity of the particles of sort a. a a is 
related to their absolute temperature T' a via the Boltzmann constant k: 

(T a = k T' a . (2.58) 

For the determination of the time- dependent solution of the gaskinetic Boltz- 
mann equation there exists a (rather complicated) recursive method which was 
developed by Chapman [38] and Enskog [64] (cf. Ref. [170], pp. 187fF). 



2.5 Comparison of Spontaneous Transitions 
and Direct Interactions 



In the following we will elaborate the interrelations and differences between 
spontaneous transitions and direct interactions. 
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2.5,1 Transitions Induced by Interactions 

Let us assume that we have A' = A + B types a of elements and that the 
elements of all types are able to change their states only by pair interactions. 
Provided the dynamics of all A! types is known, it is described by the Boltz- 
mann equations 

d A ' 

— P a (x,t) = ^^^^u; oi ,(* ) y|*',y , ;<)P 6 (y / ,<)' P a(* , ,0 

6=1 x 1 y y‘ 

A' 

- ^J2Y^J2 Wai ( x '’ y 'i x ’ i/;t ) Pi ( y ’ i ) Pa ( x ’ i ) ( 2 - 59 ) 

6=1 x* y y 1 

(a G + B}). If, however, only the dynamics of the A types a G 

{1, . . . , A} is known, it obeys the BoLTZMANN-like equations 

-^P a (x, i) = X) pH*'; 0 - w a (*'|a;; t)P a (x , <)] (2.60a) 



w a (x'\x\ t) := w a {x'\x\ t) + Y Y w at( x '>y'\ x ’ y; t)Pb(y> 0 • (2.60b) 

i = i y y' 

The connection with (2.59) is given by 

A+B 

w a (x'\ *;<):= Y Y,'52 Wai ( x ' ' y> l*>y;0^(y.0- (2.60c) 

6=A-f-l y y 1 

So the transition rates w a may describe transitions which are induced by 
unknown pair interactions. 

2.5.2 Exponential Function und Logistic Equation 

Let us consider the case of one type a of subsystems (A = 1) which can change 
between 5 = 2 states x = x G {1, 2}. Then there is 

P(2,t) = l-P(l,t) (2.61) 

where subscript a was omitted. If only spontaneous transitions occur and 
the transition rates are time-independent, the corresponding BoLTZMANN-like 
equation is the master equation 

^P(M) = W (1|2)P(2,*)-M2|1)P(1,<) 

= w(l|2) — [ti>(l|2) + ti>(2|l)]p(l,<). 



(2.62) 
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2. BoLTZMANN-Like Equations 



Its solution is the exponential function 



P(M) = P°(l) + P(Mo)-P°(l) 



-A(t-to) 



F°(l) := 



w(l|2) 



u;(l|2) + u>(2|l) 



< 1 



(2.63a) 
(2.63b) 

A:=ti>(l|2) + t»(2|l). (2.63c) 

If, in contrast to this, only direct pair interactions take place and if the tran- 
sition rates are again time-independent, the corresponding BoLTZMANN-like 
equation is the Boltzmann equation 



with 



and 



dt 



P(l,t) = [u)(l, 1|2, 2)P(2, t)P(2, t) + «;(!, 2|2, 2)P(2, t)P(2, f)J 



+ 



- [w(2, 1|1)2)P(2,<)P(1,<) + w(2, 2|1, 2)P(2,t)P(l,t)j 
«(!, 1|2, l)P(l,t)P(2,t) + w(l, 2|2, l)P(M)P(2,t)] 
w( 2, 1|1, 1)P(1, 0^(1, t) + w( 2, 2|1, 1)P(1, t)P( 1, 0] 

= -C' 1 [P(1 ) <)] 2 + C' 2 P(1,<) + C 3 (2.64) 

where Ci, C2, and C3 are constants that are determined by the pair interaction 
rates w together with (2.61). After introducing 



Q(l,t) := P(l,t)-C and C := Ca + 

we can transform (2.64) into the logistic equation 



dt Q(l,t) - rQ(l,t) (l Q q 0 ^ 



with 

and 



(2.65) 

( 2 . 66 ) 

(2.67a) 

(2.67b) 

( 2 . 68 ) 



The exponential solution of (2.62) and the logistic solution of (2.64) are com- 
pared with each other in Figure 2.1. 



r :=C 2 - 2 CiC = \/(C 2 ) 2 + 4C 1 C 3 

FT 

Ci 



o m ._ r _ v/(C 2 ) 2 + 4C 1 C 3 



e»d) » s = 



(2.66) has the solution 
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2.5.3 Stationary and Oscillatory Solutions 

In the case of exclusively spontaneous transitions with time-independent tran- 
sition rates (w a (x , \x]t) = ^(x'l*)) the BoLTZMANN-like equations for the 
temporal evolution of the distributions P a (x,t) have the form of master equa- 
tions (cf. (2.62)). According to the discussion in Section 1.3.5, P a (a;, t) converges 
in the course of time to a unique stationary equilibrium distribution P® (x). 

In contrast to this, there exist Boltzmann equations for pair interactions 
which have various stationary solutions, oscillatory solutions, or chaotic 
solutions even in the case of time-independent pair interaction rates (cf. Sects. 
8.3.2 and 8.3.4). However, this presupposes that condition (2.49) is violated so 
that the //-theorem does not hold. 




3 Master Equation in 
Configuration Space 



3.1 Introduction 



In many cases we are not interested in the states x a of the single elements 
a of a system but only in the number n % of subsystems of type a which are 
in state * (e.g. the number of molecules of sort B which are in a bounded or 
an unbounded state). Since the states of the subsystems a change stochasti- 
cally , the occupation numbers change stochastically, too. Consequently, 
the occupation numbers must be described by a probability distribution. Its 
temporal evolution is given by a master equation in configuration space (the 
space of occupation numbers). When considering only spontaneous state chan- 
ges we find, as approximate mean value equation of the configurational 
master equation, the master equation in state space from Chapter 1. Taking 
into account also pair interactions, as approximate mean value equations we 
obtain BoLTZMANN-like equations, instead. A generalization to interactions 
of higher order (between three or more systems) is possible without any pro- 
blems. Furthermore, the configurational master equation allows a calculation 
of corrections of the approximate mean value and covariance equations. 

The covariance equations are important for three reasons: First, they reflect 
the stochastic nature of the temporal changes and distinguish different systems 
with the same approximate mean value equations. Second, they are necessary 
for the determination of the reliability of prognoses on the basis of mean 
value equations and for the evaluation of the time interval after which the 
approximate mean value equations become useless. Third, they facilitate cor- 
rections to the approximate mean value equations. This will be illustrated in 
Section 10.4.1 by concrete examples. 

The configurational master equation admits the introduction of occupation 
number dependent transition rates which are necessary for the description of 
indirect interactions. Such models are especially useful for mean field ap- 
proaches or the description of social processes (cf. Sects. 8.2 and 10.4). 

Finally, the differences and interrelations between direct and indirect interac- 
tions will be elaborated in particular. 
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3. Master Equation in Configuration Space 



3.2 Transitions in Configuration Space 



The following considerations take up the discussion from Chapter 2 about sy- 
stems which consist of many elements a of a few types a. It will be assumed 
that the subsystems can be only in one of S states x £ y a := {xi ) . . . ,«s} 
though a generalization to infinite or continuous state spaces y a is also possi- 
ble. In addition, the following relations can, by exchanging S for 5 a , easily be 
extended to the case of state spaces y a which consist of different numbers S a 
of states. 

The vector 



n := . . ,,n^ s ) tr = (n 1 , . . 



tr 



(3.1) 



of the occupation numbers will be denoted as the configuration of the 
total system and represents the distribution of the elements over the states. 
Then the configuration space T f consists of the vectors n which fulfil the 
relation 



Ke{0, (3.2) 



Here N a is the number of subsystems of type a. Since we will neglect birth 
and death processes of subsystems in the following, N a is time independent. 
(For a discussion of birth and death processes cf. [84], pp. 262fF, and [292], pp. 
86ff.) 



-Y2 n *i = Na 

X t' = l 



3.2, 1 Spontaneous Transitions 

If we restrict ourselves temporarily to the consideration of spontaneous tran- 
sitions (so that interactions between the elements are excluded: w a }> = 0), the 
single subsystems can in general be regarded as statistically independent of 
each other. Then the probability distribution P(n, £) of the configuration n is 
given by 

A 

P(n,t) = l[P a (n a ,t) (3.3a) 

a—\ 

with 

n“ n% s r (3.3b) 

and the multinomial distribution [81,82] 



\P a {n\t) := 



N a \ 



| , 



rtd | 

X S * 



P a {xi,t) n *i ■ ...■ P a (x S ,t) n *s 



(3.3c) 




3.2. Transitions in Configuration Space 
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P(n, t ) is the probability with which a certain distribution of the elements over 
the states occurs. Evaluating the time derivative of (3.3) and using (2.22), we 
obtain for the temporal change of P(n, £) a master equation in configura- 
tion space (configurational master equation). It reads 



— P(n, t) — ^ w(n\n'\ t)P(n\ t) — w(n'\n\ t)P(n, £)j 



(3.4a) 



with 



w(n'\n\t) 



and 



P(n } t) > 0 



s'|*;0 if n' = 


(3.4b) 


otherwise , 


• > ( n x — !)>•••> n x 1 ’ ■ ■ 


•>O tr . (3-5) 


£ F ( n ’*) = 1 




(3.6) 



According to this, the configurational transition rate n x w a (x , \x; t) = 
n x w a (x f \x; t) is, on the one hand, proportional to the number n a x of the sub- 
systems which can leave state *. On the other hand, it is proportional to the 

individual transition rate w a (x'\x\ t). 



3.2.2 Pair Interactions 



In order to consider, apart from spontaneous transitions, also pair interactions 
we apply the method from Section 2.2: We decompose the total transition rate 
u^n'ln; t) into contributions which describe interactions between k subsystems. 
Afterwards we again neglect interactions of higher order (A: > 2). Then master 
equation (3.4) becomes 



!«»■'> = £ 1 

a, a;, a' , t ' 


w(n\n£“;t)P(n“',“,t) - |n; t)P(n, t) 




+ 5 £ 








'b,yy,y' 








-w{n a J, b y, a x b y | 


n]t)P(n,t)} 


(3.7) 


with 


.•,«: + i): 






Jl 

55 


,...,K-i),...) tr 


(3.8a) 


and 












.,K' + l),...,(nJ-l),...) tr . 


(3.8b) 
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If the type a of an element does not change (or at least very slowly), the 
transition rates will normally have the form 

w ( n x'x\ n 'i 0 : = ^aa i w a (x , \x;t)nl = S aa iw a (x , \x; t)n% (3.9a) 

and 

w ( n i'y'xl l n ;0 : = baa>hb'Wab{x\yf\x,y\t)n a x n h y 

Na & 1 S aa i8 w Wab(x' ,y'\x,y,t)nl^- , (3.9b) 

Mb 



i.e. 



w(n / \n; t) 



f w a (x'\xit)nZ if n' = n a x , a x 

< ^ab{x\yf\x,y\t)n%n h y if n! = n^ b y ,f y 
0 otherwise. 



(3.10) 



Here the configurational transition rate due to spontaneous transitions is again 
proportional to the individual transition rate w a (x f \x;t) and to the number 
of the elements which can leave state x. In contrast, the configurational 
transition rate which goes back to pair interactions is, on the one hand, pro- 
portional to the individual pair interaction rate w a b(x', y'\x, y\ t). On the 
other hand, it is proportional to the number n%n h y of possible pair interactions 
between elements of type a which are in state x and elements of type b which 
are in state y. Therefore, in order to be precise, in the case n f = '^•y'xx 
the configurational transition rate w aa (x f , y)\x, x\ t)n“n“ must be replaced by 
w aa (x' , y f t) n x( n x ~ 1)^ Otherwise we would admit pair interactions of an 

element with itself (which was explicitly excluded by (2.9)). Nevertheless, the 
form (3.10) of the configurational transition rates can be used as an approxi- 
mation if 



• w5aa(* / j y'\&, *5 0 <C w a (x*\x;t) is fulfilled or 
y' 

• n a x 1 is valid for all configurations n for which P(n, t ) is not negligible. 



Whereas the first case mainly applies to systems with weak internal interactions 
(w a b & 0) the second case is mostly given for systems which consist of many 
elements ( N ^ 1). In the following the validity of approximation (3.10) will 
always be presupposed. 



3.3 Mean Value and Covariance Equations 

With respect to the configurational master equation, the mean value and co- 
variance equations once again are of special interest. In analogy to (1.168), the 
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mean value (/(n,i)) of a function f(n,t) is defined by 

{fit = </(». 0) : = S 0 • 



(3.11) 



Due to the formal relation between the master equation (1.5) in state space and 
the configurational master equation (3.4), the mean value equations have 
the form 



-«> = K(tm)> 



(3.12) 



with the first jump moments 

m“(n,<) := ^(An')Xn'|n; <) 

n' 



& y,y' 

b y,y‘ c z,z ' 

Since the individual pair interaction rates are independent of the arbitrary 
numbering a of the elements, we have 

Wba{y' ,x'\y,x;t) = Wf} a (y',x'\y,x-,t) 

= w a p(x',y'\x,y;t) = w ab (x' ,y'\x,y,t) (3.14) 

and an explicit evaluation of (3.13) leads (after a substitution of y by *') to 

(3.15a) 



m x (n, 0 = ^2 w a {x\x'-, n; t)n a x , - w a (x' |x; n; t)n“ 



with the effective transition rates 



w a (x'\x;n;t) := w a (x'\x\ t) + ^ ^ ^ w ab (x', y'\x, y; <)n* 

b y‘ y 

N *= x w a (x'\x] t) + ^2 ^2^2 Wab{x', y'\x, y, t)^- . 

b y' y b 



Hence, the approximate mean value equations 

d K) 



(3.15b) 



dt N a 



”£((»)>*) 



(3.16) 
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can be written in the form 

(3. 

with the mean transition rates 



17b) 



(cf. Ref. [150]). For N a >> 1 the approximate equations for the temporal change 
of the expected proportions ( n“)t/N a agree with the BOLTZMANN-like equa- 
tions (2.22), i.e. we have 

(3.17c) 

and the mean transition rates (3.17b) correspond to the effective transition 
rates (2.22b): 

(3.17d) 

In the following we will always assume N a 1, «.e. the existence of a system 
that consists of many elements . 

The mean value equations (3.17) are exact equations if the configurational 
transition rates w(n'\n;t) depend linearly on the occupation numbers (if in- 
teractions do not play any role). Otherwise they presuppose the validity of the 
condition 

( n >l) * «)(”£> ( 318 ) 

which obviously corresponds to factorization assumption (2.15). As a con- 
sequence, the applicablility of the mean value equations assumes the absolute 
values of the covariances 

<1 = («£"*') - «)("*') (3-19) 







to be small so that the condition 

(3.20) 

is fulfilled. (3.20) must be checked only for the variances a™ because the 
Cauchy- Schwarz inequality (cf. p. 304) together with the triangle in- 
equality (cf. p. 304) imply the relation 





(3.21) 
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In any case we require covariance equations. Analogously to (1.180), the 
approximate covariance equations read 



dt 



»”*«'((»»). 0 + E 

c,y 



_ac , 6 c ^mg((n),Q \ 

. ** 0(n$> + 0(n$) ) 



Here 



m“ b x ,(n,t) := ^(An')£(An')*,u;(n'|n; <) 

n> 

c y,y' 

+ ^ **"!«;*) ( 3 - 23 ) 

c y ,y 7 d z,z' 

are the second jump moments which have the explicit form 

m S *<( n »0 

= $<,4 jtar' ^ [u;°(*| y;n;t)n“ + w a (y\x;n;t)n“ 

-[u> 0 (x|x'; »;<)«£» + w a (x'\x;n;t)n“ ) 

+ Y E [™ a *( X > X 'l*/- y'> O n y n y' + Wai(y, VI*, <)«>*' 

y' y 

- YY, [«'«»(y. SB/ l a! iV , ;0 n S n 5' + w a 6(x,y'|y,x';0 n » n t' 



(3.24a) 



y' y 



(3.24b) 



An elegant derivation of the mean value and covariance equations is also pos- 
sible by means of the operators A%, % which are defined by 



A a x >J(n,t) := f(A&n,t) := /(»&,*) 



(3.25) 



(cf. Refs. [294,289]). This bases on the following representation of configura- 
tional master equation (3.7): 

-^P(n,<) = Y [«>H Ki'A)HKx'A) ~ 



a.x.a'x' 



+ \ E E 

-w(n“;y“*|n;<)P(n,<)] 



a,x,a' ,x' fc > y > 6 / ,y , 

^a' b' ab \ 




76 



3. Master Equation in Configuration Space 



= 2 (a™ 1 - l^w(n°',“\n;t)P(n,t) 

a,r,a , J ar / 

+ \ £ £ (^''45'' -i)^ 

a,x } a' ) x l b^yb^y' 

Moreover, the moments 



•••••(<)'*) (3-27) 

can be calculated via the characteristic function (Fourier transform) 



F( u>; t) := ^ P(n; £) exp(iu> • n) 

n 

and the generating function (Laplace transform) 

n a,* 

This methods utilize the relations 
/, j ,,i , A . t A\ 1 d 'i d 1 * ( . 

and 

(«)'! ■ • «)'«) = • • • (o* 4?T GiV ' l) 



(3.28) 



(3.29) 



^;=...=ui^=o 



(3.30) 



u}=...=ug = 1 



(3.31) 



3.4 Corrections and Higher Order Interactions 



According to Section 3.3, the exact equations 

^ = «(-,<)) (3.32) 

for the first moments (n£) (i.e. the mean value equations) depend, because 
of relations (3.15), on the second moments (n“n y ). However, the exact equa- 
tions 






= ( m L'( n > 0) + (n“m*,(», <)) + (n b x ,m°(n, t)) 



dt 



(3.33) 
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for the temporal evolution of the second moments (n x n b x ,) depend, due to the 
relations (3.24), on the third moments {n x n b x ,n x „) etc. Like in Section 2.2, 
we face the problem that the exact moment equations represent a hierarchy 
of non- closed equations. Hence suitable approximations for the moment 
equations must be found. 



For the discussion of this problem let us consider the general case that interac- 
tions between up to k elements have to be taken into account. Then in master 
equation (3.4) we must insert the transition rates 



w(n\n\ t) 



where 



W ai \ 


(*il*i;<)«Si 




if n' 


— n ai a i 

- n x' l Xi 


w aia 2 { x \ > x 2 1*1 1 *2> t) n x\ 


n x 2 


if n' 


— n a i a 2 ai a 2 
x , 1 x , 2 xix 2 


w ai . 


• • • • i ** 1*1) • • • 




) 






x<; •••••< 




if n' 


II 

3 

H ’fc 
^ ?r 


0 






otherwise 



(3.34) 



n 



ai ...Ofc ai ...afc 

x\ -Xk 






ak 



Consequently, the temporal evolutions of the Ith moments ( n x \ • . . 
the equations 



(3.35) 
•<!) obey 




= EE [(»')£ • • • • • («')£ -<:••••• <]«>(«>; <)p(n,<) 

n n' 

= EE [(( A «')s; + <;) • • • • • + <:) 

n n ' 



-n„ 



w(n'\n;t)P(n, t) 



(3.36) 



(cf. (1.170)), i.e. 



(3.37) 



Here 



:::*£(»,*) := ^(An , )r‘; • . . . ■ (AnOS^^n'In;*) 



(3.38) 
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are the mth jump moments and Ym means summation over all subsets 
{iij...,z m } consisting of m elements of the set {1,...,/} = If 

the transition rates u;(n'|n;^) = u;(n'|n) are time-independent , the moments 
are expected to approach a unique solution in the limit t — ► oo since P(n, t) 
converges to a unique equilibrium distribution (cf. Sect. 1.3.5 and Fig. 3.1). 

Due to (3.34) and (An')£ G {0,±1, . . . ,±fc}, the mth jump moments are po- 
lynomials of order k in the occupation numbers n£**. Thus equations (3.37) 
for the /th moments depend on the first up to the (/ — 1 + Ar)th moments. Con- 
sequently, the exact moment equations (3.37) will represent a closed system 
of equations only if k = 1 (if interactions between elements can be neglec- 
ted). In the case of interactions between k > 1 subsystems we must choose an 
appropriate approximation of (3.37) or of 

<[■■■■■ <: 

= [(< - «;>) + «;>] • • • • • [(« 2 : - «:)) + {<:)] (3-39) 

= • • • • • 
m=0 Mm 

<*•») 

(3.40) is the exact Taylor expansion of (3.39) (cf. [270] and [100], pp. 340ff). 
If we are interested in the first up to the jth moments (1 < l <j, j >1) there 
are essentially two ways of getting approximate moment equations: 



a) The substitution of all jump moments ra*** by Taylor approxima- 
tions of first order: 



+ X>S-te» 



d 



a,x 



d(K) 



m x 



i 1 ■••*»» 



:((»>.<)• 



(3.41) 



In this case the equations for the /th moments depend only on the first up 
to the /th moments but each equation is an approximation. This method 
was applied in Sections 1.5 and 3.3. A comparison with (3.40) shows that 
it corresponds to the assumption 



l(("Si -Ki:»)l 



< 1 



for 1 < m < k 



(3.42) 

which can be checked by evaluating the first up to the fcth moments. 
Hence one should choose j > k. 
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b) The substitution of all expressions • 71 **™+* • • n*-|j by 

Taylor approximations of jth order: In this case the equations for the 
first up to the (j — k -f 1 )th moments are exact (cf. (3.40)). However, 
the equation for an arbitrary /th moment depends on the first up to 
the (/ — 1 -f k ) th moments the higher ones of which are determined by 
approximate equations. Method b) corresponds to the assumption 



[((»*■! - (»£'>)) • . . ■ • (nS‘> - (nS‘>)))| 

( n *il )■•••' ) 



<1 



for j < m < l — 1 + k 



( 3 - 43 ) 

with 1 < / < j. (3.43) is fulfilled for a certain time interval which, in 
contrast to method a), cannot be determined by solving the approximate 
moment equations. Nevertheless, (3.43) is normally guaranteed for large 
m if the absolute values of the covariances are small in the sense of 



<r a xx , <e«)(rv) 



with 


0 < £ < 1 







(3.44) 



(which is often the case in the thermodynamic limit N 1). Then, 
where P(n, t) is not negligible, the absolute value of 



>*H> 



- 1 



Tlx 



- 1 



(3.45) 



is normally small since 



— )) ■ • 


• ~ ( n *im)))l m/2 


W,}-- 





(3.46) 



For j = 1 (i.e. a restriction to mean value equations) methods a) and b) lead 
to the same approximate equations. However, as soon as we are also interested 
in moments of higher order (j > 2) we can expect better results from method 
b) than from a) because method b) bases on an approximation of higher order 
(cf. Fig. 3.1 and Sect. 10.4.1). 

For k = 2 (i.e. including pair interactions) and j = 2, for example, method b) 
would provide the corrected mean value equations 



d{<) 

dt 






6,2/ c.y 7 



be d 2 m$((n) t t) 
yy‘ d(n>)d( n ;,) 



(3.47) 



(cf. Ref. [151], pp. 130ff) which, due to (3.15), contain terms of the thus form 
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0.00 1 I 0.00 1 - — ■ — — ■ - I 

0.00 0.15 0.90 0.45 0.60 0.00 0.15 0.90 0.15 0.60 



<n t >/N <n 1 >/N 

Figure 3.1: Left: Phase portrait of a periodic solution of a special approximate 
mean value equation (cf. Sect. 8.3.2). 

Right: The solution of the corresponding corrected mean value equation moves 
in a spiral way towards a stationary point as expected according to Section 
1.3.5. 



(n%n b x i) and depend on the covariances a x b x , = ( n x n b x ,) — ( n x )(n b x ,). The cor- 
rected covariance equations according to method b) were given by 



~ m ab <7«\ t\j--X'X'* ed d 2m St'(( n M ) 
~ ^ yy ' d{n‘)d(n d y ,) 

T L e H((")V jt c gmg((n),Q \ 
+ 2 ^ ( a *v f)/ n c\ +<T *'t/ 



(3.48) 



and would use (3.24) as well as the approximations 

* <<>K)K 3 3 )+^:s 2 2 (^>+^;^k)+^s 3 3 k i 1 > • (3.49) 



3.5 Indirect Interactions and Mean Field Ap- 
proaches 

In cases of indirect interactions not only the configurational transition rates 
but also the individual transition or interaction rates depend on the occupation 
numbers n x of the states x. Restricting ourselves to spontaneous transitions 
and pair interactions (k < 2), we must insert into the master equation 

— P(n, £) = ^ ju^nln'; t)P(n ' , t) — w(n'\n] t)P(n , t) j (3.50a) 

n‘ 




3.6. Comparison of Direct and Indirect Interactions 
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the generalized transition rates 





f 

w a {x'\x\n\t)n% 


<3 H 

si 

II 


w(n f \n; t ) = < 


w a b(x' ,y'\x,y]n\t)n a x n\ 


if n f — n a ^ a ^ 

ii n — n x , y , xy 




0 

k. 


otherwise. 



(3.50b) 



A derivation of the corresponding approximate and corrected mean value and 
covariance equations is directly possible with the Taylor approximations from 
Section 3.4 (cf. Sect. 10.4). 

The generalization to occupation number dependent transition rates is parti- 
cularly important for the description of social processes where the behaviour 
of the considered individuals can depend on the (so cio) configuration n [292, 
289]. However, it is also relevant for mean field approaches [24] which are 
applied in physics (e.g. in the Ising model of ferromagnetism [10,148,304]). 
Mean field approaches essentially correspond to the approximate mean value 
equations (3.16) or their stationary solution. 



3.6 Comparison of Direct and Indirect Inter- 
actions 



In the following we will elaborate the interrelations and differences between 
direct and indirect interactions in more detail. 



3.6.1 Differences Concerning the Covariance Equations 

First, let us compare the mean value and covariance equations of two diffe- 
rent configurational master equations: In the first case the transition rates are 
assumed to have the form 

«'a»(* , ,y / |*,y;O n S n » if n ' = K'l'fy 

0 otherwise 



3.51) 



w(n \n\ t) 



which exclusively describes direct pair interactions. For the second case let us 
choose 



X] ^ab(x',y'\x, y, t)n b y n a x if n' = n a x , a x 

b y y' 

0 



w(n'\n; t) 



otherwise 



(3.52) 
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which means configuration dependent individual transition rates 

w a (x'\x; n;t) ™ab(x’ , v' \x, y; t)n b y (3.53) 

4 y y' 

and, therefore, indirect interactions. 

A comparison of both cases with respect to (3.12) and (3.15) shows that the 
mean value equations of the corresponding master equations (3.50) are comple- 
tely identical. However, a difference appears concerning the covariance equa- 
tions: In the case (3.52) of indirect interactions only the terms (3.24a) of the 
second jump moments contribute to the covariance equations whereas in the 
case (3.51) of direct interactions additionally the terms (3.24b) have to be taken 
into account. Hence we cannot manage without the concept of direct interac- 
tions. 

In summary, by means of the covariance equations we can distinguish various 
stochastic processes which possess identical mean value equations. 



3.6.2 Differences Concerning the Mean Value Equations 



In the last section it became clear that for each model which bases on direct 
interactions there exists a model which bases on indirect interactions and has 
the same mean value equations. Now it will be demonstrated that the opposite 
is not true. For this purpose, let us consider 

w(x \x\n\t) :=i/e 



= 7i [ K ( n *' “ n *)] 

/=o 

OO / / i| 

= (3-54) 

/=0 k= 0 v ' 

which can be transformed into 



w(x'\x; n; t ) = ^(n*)* !?(*) (3.55a) 

*=o 



with 

OO / 

W(fc) •’= ^(-1)* ^2 &!(/ _ ( nx ') 1 k - (3.55b) 

The transition rates w(x'\x] n; t) could be understood as a result of direct 
interactions only if the functions w^) were interpretable as transition rates of 
direct interactions between k elements. Because of k) < 0 for odd k and 
the dependence of w ^ on the occupation numbers n x t this is not possible. 
Therefore, we cannot manage without the concept of indirect interactions. 




4 The Fokker-Planck Equation 

4.1 Introduction 



From the master equation we can derive a Fokker-Planck equation by means 
of a second order Taylor approximation. The Fokker-Planck equation is 
a linear partial differential equation of second order so that, not least 
thanks to the analogy to the Schrodinger equation [250], there exist many 
solution methods for it [27,83,84,86,95,96,241,242,259]. In contrast to the 
master equation the Fokker-Planck equation takes into account only the 
first two jump moments. The mean value and covariance equations, however, 
agree with those of the master equation. 

From the Fokker-Planck equation, by factorizing the pair distribution func- 
tion, we can gain ‘Boltzmann-Fokker-Planck equations’ which are 
also derivable from the BoLTZMANN-like equations by means of a second order 
Taylor approximation. An application of these new equations is discussed in 
Chapter 9. Since the Boltzmann-Fokker-Planck equations depend non- 
linearly on the probability distributions their solution is complicated. However, 
there exists a recursive solution strategy that is related to Hartree’s self- 
consistent field method for quantum mechanical many-particle systems [114] 
(which is applied to the evaluation of atomic and molecular orbitals). 



4.2 Derivation 



The continuous master equation (1.5b) can, by writing X' in the first term as 
X — Y and in the second term as X -f- Y, be transformed into 



4-P(X,t)= [ d M Y\w[Y\X - Y;t]P(X -Y,t) - w[Y\X;t]P(X,t)} (4.1a) 
at J V 



with the notation 

w[Y\X;t\ := w(X + Y\X,t ) 



(4.1b) 



For the right-hand side of (4.1a) a Taylor approximation [270] can be car- 
ried out (cf. Ref. [100], pp. 550ff) which, when applied to the master equation, 
is denoted as the Kramers-Moyal expansion [160,193,84]. Introducing 
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the /th jump moments 



:= j d M Y Y( l - Y^w[Y\X-,t] 



with 



we obtain 

I wo 



l l\ + . . . + Im 



(4.2a) 

(4.2b) 



■ J^ r {[t E 



(-Y 1 )'i...(-Y„)'*' 



7m 

(h+ - -Wm=0 



/i!.../Af! 



dx 

00 (- 1 )' 



-jjfiz ("PI*; <))] - Hy\x-, t]p(x , <)} 



E m 

(4.2c) 



(h+--"WM= : 0 



If series (4.2) is convergent, (4.2) and (4.1) are completely equivalent formu- 
lations. In particular, the transition rates w\Y\X;t\ of the master equation 
can be uniquely reconstructed from the corresponding jump moments (4.2a): 
w[Y\X;t ] is the inverse Fourier transform 



(4.3a) 



of the charakt eristic function 



(4.3b) 



In addition, because of 



0(u>, X ; t) := J d M Y exp(iw • Y)w\Y\X ; t] 



w[Y\X;t] = J d M u exp(-iu> Y)0(w,X;<) 



m h -Im(X, t) = 






i'l+ ' +'Af fab 



d‘ M 



, (4.3c) 



U)i = ...=U)M=0 



the Taylor representation of the characteristic function is determined by the 
jump moments (cf. Ref. [241], pp. 16ff): 



e(w,X;«) 



E E 

(=0 



m h ...lu(X,t) 



Qui)' 1 

hi 



(i u m ) ,m 
ImI 



. (4.3d) 
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In order to analytically and numerically cope with the Kramers- Moyal ex- 
pansion (4.2), it is desirable to approximate this by l terms of the infinite series. 
Since P(X,t) has the meaning of a probability density, our demand is 

P(X,t)> 0 VX,t. (4.4) 

According to the Pawula theorem [216,241] this condition is only fulfilled 
for / < 2. In the second order approximation (/ = 2), (4.2) takes on the form 
of the Fokker-Planck equation [84]: 



where 

and 






dx r 



(4.5) 



mi with // = 1, lj = 0 for J ^ I 



(4.6a) 



mu := 



m h -iM with // = 1 = lj ) Ik = 0 for K / J, J if J ^ I 
m h iM with h = 2, Ik = 0 for K ^ I if J = I. 



(4.6b) 

The configurational Fokker-Planck equation corresponding to the con- 
figurational master equation (3.50) can be obtained analogously. It reads 



-P(rM) = [rn a x (n } t)P(n,t)\ 

a t x b,%' * 



(4.7) 



if n is formally treated as a continuous quantity. Combining the subscripts x 
and superscripts a in a vectorial subscript 

I := (a, aj) tr (4.8) 



and replacing n % = nj by Xi as well as n by X, (4.7) has the same form as 
(4.5) with M = AS. In the following, notation (4.5) will therefore also be used 
for the configurational Fokker-Planck equation (4.7). 



For the second order Taylor approximation to be applicable, the expansion 
parameters Yj have, on average, to be small compared to the system size 



ft := max max IX r — XA . 
/ x' } x 1 1 1 



(4.9) 
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The decisive quantity for this is the smallness parameter S defined by 

2 m n (X,t) 

b := max max ■ — - 
i x W fd M Y'w[Y'\X;t] 

fjMvfYiV u[Y\X-,t] 

= Y [n) < 



(4.10) 



since it is a measure for the relative average jump distance. Transforming 
master equation (4.1) into scaled co-ordinates 



Z := eX with e := — , 

a L 

the corresponding Kramers-Moyal expansion takes on the form 



^z,t) = -£ : 



0i+ •• -M m =0 



/! 



d h d ,M 



with the scaled jump moments 






Therefore, with regard to the order of magnitude, we find 



(4.11) 



(4.12a) 



(4.12b) 






~ S' J d M Y' w\Y'\X\t}. 



Ym\ m w[Y\X;t] 
fi J fd M Y'w[Y'\: 



Y\X;t) f 
>w[Y'\X-,t]J 



d M Y' w\Y'\X]t] 



(4.13) 



According to this, in the case (5 < 1 the scaled jump moments become smaller 
and smaller for increasing values of / = /i + . . . + /m- Consequently, they are 
most often neglected for l > 2. However, we must not neglect the second jump 
moments if we do not want to suppress the effect of the fluctuations. Hence as 
a suitable approximation we obtain the Fokker-Planck equation 



|'’< z .‘) = -Ea|:[w z . ! )c(z.0]+|i: 









4.3. Properties 
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with the scaled first jump moments 



ki(Z,t) := Mi(Z,t) := em/ 




and the scaled second jump moments 



qu(Z,t) := Mu(Z,t) := emu 




(4.14b) 



(4.14c) 



4.3 Properties 



In the following we will briefly treat the properties of Fokker-Planck equa- 
tion (4.14). 



4.3.1 The Continuity Equation 

Equation (4.14) can be written in the form of the continuity equation 



|p(Z,f) = -Vj(5M) 



(4.15a) 



Here 

r\ 

ji(Z, t) := k!(Z, t)P(Z, [q u {Z, t)P(Z, t) 



(4.15b) 



is the component I of the probability current density j. 



4.3.2 Normalization 



In the following we will occasionally require Gauss’ divergence theorem 
which facilitates the transformation of a volume integral over the diver- 
gence Vj(Z,£) of a vector field j(Z,t) into a surface integral (cf. [246], 
pp. 1048ff): 



1 d M ZVj(Z,t) = 


f dO{Z) ■ j(Z,t) 

J 


J 

V 


dV 



(4.16) 
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Here dV means the surface of volume V. dO(Z ) denotes vectors of magnitude 
dO which are perpendicular to the differential surface elements of dV of area 
dO and point to the outside. Thus we integrate over the component 

•'■(*') < 417 > 

of j(Z ,t) that is perpendicular to the surface. 

Gauss’ divergence theorem and continuity equation (4.15) imply 

^ J d M ZP(Z,t) = -f d M ZVj(Z,t) 

V V 

= - J dO j(Z,t) 

dV 

= 0 (4.18) 



if the probability current j vanishes on the surface dV of the system. Hence, 
for the possibility of normalization 



(4.19) 



we must demand the boundary condition 

i(Z ' () U w i0 (4 ' 20) 

In order to guarantee this we will assume that the relations 




d 

dZj 



P(Z,t) 



zeav 



= 0 



(4.21a) 



and 

p M 26S „=° < 42ib > 

are valid, i.e. that the probability distribution P(Z,t) decreases fast enough in 
the neighbourhood of the surface dV and vanishes on it. This is most often the 
case. 



4.3.3 The Liouvilie Representation 

The Fokker-Planck equation can be represented in the form 



^P(Z,t) = C(Z,t)P(Z,t) 



(4.22a) 




4.3. Properties 
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with the Liouville operator 



C(Z,t):~ ^2 ^ k T (Z, t) + ^ ^2 



i,j 



8Z! dZj 



qu(Z,t) 



(4.22b) 



Since, due to (4.3), every Fokker-Planck equation is equivalent to a master 
equation (4.1), many properties of the Fokker-Planck equation directly cor- 
respond to those of the master equation (cf. Sect. 1.3). Therefore, only a short 
review of these will be given in the following. 



4.3.4 Non-Negativity 

If condition P(Z, £o) > 0 is fulfilled for all states Z at an initial time <o, we 
have 

P(Z,t)> 0 (4.23) 

for all later times t > to. 



4.3.5 Eigenvalues 

The eigenvalues A of the Liouville operator have the property that their real 
part is non-positive: 

Re(A) < 0 (4.24) 

(cf. Ref. [241], pp. 104f, 143ff). Since the Fokker-Planck equation possesses 
the eigenvalue A = 0, the existence of a stationary solution P°(Z) is always 
guaranteed. 



4.3.6 Convergence to the Stationary Solution 



For the entropy-like Liapunov function 

K(t) :=Jd M Z P(Z , t) In (4.25) 

(where P{Z ) t) and P ; (Z, t) are two solutions of the Fokker-Planck equation 
(4.14) for different initial distributions) one can show 



K{t) > 0, 

mi < o 

dt ~ 



(4.26a) 

(4.26b) 
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in the case of time-independent jump moments. If P\Z,t) := P°(Z ) is a sta- 
tionary solution and if qu(Z) > 0 and P°(Z) > 0 holds for almost every Z 
(i.e. with probability 1), every solution P(Z ) t) of the Fokker-Planck equa- 
tion will converge to P°(Z) in the course of time and the stationary solution 
P°(Z) is unique. (Cf. Ref. [84], pp. 61ff.) 



4.4 Solution Methods 



4.4.1 Stationary Solution 



The stationary solution P°(Z) of the Fokker-Planck equation in several di- 
mensions can be explicitly found only under certain circumstances. First of all, 
the jump moments ki(Z,t) and qu(Z ) t) must be time-independent. Further- 
more, it will be assumed that in the stationary case not only the divergence 
'Vj(Z) of the probability current j(Z) vanishes but also the probability current 
itself: 

j(Z) = 0. (4.27) 

Then (4.27) can, by means of the product rule (cf. p. 306), be brought into the 
form 






(z) 






dZj 



(4.28) 



If the matrix q(Z) = (qij(Z)) has the inverse £ 1 (Z) = (qjj (Z)^j we obtain 



5 ring'll dP 0 (Z)/dZ! 

W^ nP {z)] ~ P°(Z) 



= £?7k(2) 



K 



- ( k K (z) - £ JL 9 kj (z) 



=■■ MZ). 



(4.29) 



Since the left-hand side of (4.29) is a gradient, (4.29) can only be fulfilled on 
the integrability conditions 



dfi ± dfj 
dZj dZi 



V7, J . 



(4.30) 



However, if these are satisfied, the stationary solution can be written in the 
form 



P°(Z) = P°(Z o)e 



-HZ) 



(4.31a) 




4.4. Solution Methods 
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with the potential 

(4.31b) 

(cf. Ref. [84], pp. 106f). This implies 

/j(z) = "0§7* (z) - (4 ' 32) 

In the one- dimensional case integrability condition (4.30) is automatically ful- 
filled. The Fokker-Planck equation then possesses the stationary solution 



(4.33) 



4.4.2 Path Integral Solution 

For the time- dependent solution of the Fokker-Planck equation a path 
integral representation exists which is related to the quantum mechanical 
path integral solution of the ScHRODlNGER equation [73,75]. This reads 

P(Z, t) = Jim f d M Z n -! ... I d M Z 0 f[P(Zi,ti\ z i-uti-i) P(Z 0 ,t 0 ) 

n_4 °° ^ ^ Li=i 

(4.34a) 

with 

Z n = Z, ti := to + iAt, At := , (4.34b) 

n 

and the conditional probability 

P{Zi+\,U+\\Zi,ti) := -r~== = ■- 

^/(2ir) M €At|£| 

XCXP (~2eK<[^ +1 ~ - Zi - fc(Z,-,<,)j) 

(4.34c) 

(cf. Ref. [241], pp. 73ff). Here q~ l denotes the inverse of q = (qu) and \q\ its 

determinant. 
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4.4.3 Interrelation with the Schrodinger Equation* 

Since quantum mechanics provides many solution methods for the 
Schrodinger equation, it is sometimes useful that the Fokker-Planck 
equation can, on certain conditions, be transformed into a Schrodinger equa- 
tion. 

Let us assume the existence of a stationary solution of the form (4.31) which 
fulfils integrability conditions (4.30) and makes the probability current (4.15b) 
vanish. Then, like in Section 1.4.1, the Fokker-Planck equation 

^P(Z,t) = C(Z)P(Z,t) (4.35) 

(cf. (4.22)) can, by means of the transformation 



Hz,t) 



P(z,t) 

•Jp»m 






(4.36a) 



be converted into the equation 






with the HERMiTian (self-adjoint) operator 



H:=e* l2 Ce 



<H 2 



% can be written in the form 



n =w^‘wr vm 



(4.36b) 



(4.36c) 



(4.36d) 



with the potential 



V(Z) 



J/2 

e 



a d 

dZi tqiJ dZj 




f d<j> d<f> f f d d<s>\ 

4 qiJ dZt dZj 2 V dZi qu dZj ) 



(cf. Refs. [241], pp. 139ff, and [242]). Now, after formally setting 

t i 



r:= ifi = -r 



(4.36e) 



(4.37a) 



This section can be omitted during a first reading. 
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and 




|V>(r)) :-i>(Z,ihT), 


(4.37b) 


(4.36) takes on the form 




ifi^r|V>(r)) =H\ V>(r)) 


(4.37c) 



of a Schrodinger equation [250] with the ‘mass tensor 5 




4.5 Mean Value and Covariance Equations 



First, let us define the mean value of a function in the same way as in (1.168). 
Then, by multiplying the Fokker-Planck equation (4.14) by Zk and ZkZl 
and integrating over Z we obtain 



and 



d(Z L Z K ) 
dt 






= {Z K k L (Z,t)) + { ZLk}((Z,t )) + €(qKL(Z,t)) . 



Here we applied the relations 



dZi 

l^Z K pr7 ar7 QiJ 



l,J 



dZi 8Zj 



= £ 



d_ 

■j'dZj 

d 6 



x>9§:*' = 



(4.39) 



(4.40) 



(4.41a) 






dZj 8Zj 



{ZkQij} 



- 'Ew I ( qKI+qiK )’ 



dZi 



(4.41b) 



C\ C\ 

V ZsZt-^-kj = (ZtcZik/) - ( ZgkL + ZLkjc) , (4.41c) 



'dZj 

Yh^Zt ^r, QIJ 
I,J 



dZ, 
d d 



dz I dZj' liJ 'ZjdZidZj 



( ZkZlQIJ ) + (qKL + iLK^j 



— Y1 i qiK + qKI ) + ( 9/L + qLI ) • 



(4.41d) 
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Apart from this, we used Gauss’ divergence theorem and neglected the surface 
terms because of the assumed vanishing of P(Z,t) on the surface dV . 

Finally, for the covariances 

SklO) ■■= (ZkZl) - (Z K )(Z L ) (4.42) 

we find the equations 

- Jf - = (e<lKL(Z,t)^ + ({Zk - ( Z K ))kL(Z,t )) + Zl - (ZL))k K (Z,t)^ . 

( 4 - 43 ) 

If the mean value equations (4.39) and the covariance equations (4.43) are 
subject to an inverse transformation into the co-ordinates X = Z /e, they read 

(4.44) 

and 

^ = (mu(X t t)) + ((*/ - (X/»m,(X,t)) + ((Xj - (Xj)) mi (X,t)) 

(4.45) 

A comparison with (1.172) and (1.175) shows that these equations completely 
agree with the mean value and covariance equations of master equation (1.5). 

During the evaluation of (4.39) and (4.43) we will, in general, apply the same 
approximations as in Section 1.5 in order to obtain closed equations. By 
means of a method analogous to the projector formalism of Section 1.2.4 
one can also derive exact closed mean value equations from (4.22) which, 
however, like (1.72), contain a time integral (i.e. they are non-local in time: cf. 
Ref. [292], pp. 73ff). 

4.5.1 Interpretation of the Jump Moments 

The interpretation of the first and second jump moments follows directly from 
mean value equations (4.44) and covariance equations (4.45). If X is the state 
of the considered system at the time t this implies 

P(X',t) = Sx’x and (/(*', <)>* = f(X,t) (4.46) 

for an arbitrary function f(X,t). Consequently, the following holds: 

mi(X y t) = (m/(Af',^))t 
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= J&i; (<*»■«• -<*»■) 



and 



= |,„(0 + 0 + 0 - mpi.m'MX'M 



with 



dt 



_ HxW)t , yM d(x'j) t , d{x'j) 

- — Q~ t \ x ih—5i ( x Jh^r~ 



dt 



dt 



- iS x 4(< x »‘«*- x ') 



t+At 



(/(X', t + A *)) = £ /(X', t + A t)P{X\ t + At ) . 



(4.47b) 



(4.48) 



That is, mj determines the systematic change of the mean value ( X \ ) (the 
drift) and mu the change of the covariance au (the diffusion, i.e. the 
broadening of the distribution P(X, t) by fluctuations). Therefore, we call the 
first jump moments mj the drift coefficients and the second jump moments 
mu the diffusion coefficients [241,151,292]. 



4.6 Boltzmann-Fokker-Planck Equations 



In the following we will start with a Fokker- Planck equation for a system 
that consists of N subsystems a and proceed in a similar way as in Chapter 2. 
Again we combine the states x a of the N subsystems a in the vectors 

X :=(*!,... ,x a ,...,x N ) tr , (4.49a) 



®or — (^al) * • • ) %cti > • • • > %otm) £ Tor • (4.49b) 

Then in (4.5) we have I := (a, i) tr and M = m ■ N . 

Now let us define the vectors 

X a := (0, . . . , 0, x a , 0, . . . , 0) tr 



(4.50) 
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and 



j£<x i...a k 



k 



■■= E x "' ■ 



The transition rates w[Y\X;t] can be written in the form 



(4.51) 



w aia2 [Y a ' a *\X a ' a *;t] 



u>[r|X;*]:= { 



a "\X ai - 



ify = Y ai 

if y — Y a i a 2 



;t] ifY =Y ai - a " 



0 



otherwise 



(4.52) 



in which they are decomposed into contributions 

w ai ... ak [Y a '- a *\X a '- ak -t} (4.53) 

that describe the interactions between k subsystems. The restriction 

u)<x 1 ...ot k =0 if two subscripts a,-, aj are identical (4.54) 
guarantees that self-interactions are excluded. 

Furthermore, only the terms k < 2 will be taken into account which is well 
justified if the transitions of the subsystems are essentially due to spontane- 
ous transitions w a and pair interactions w a p. Then, as jump moments of the 
corresponding Fokker-Planck equation 

Of,* 

+ 3 £ £ a^-^J M*- ,)p(x ' *>] ( « 5 *> 

a,i p,j al PJ 

result 

m ai (X,t) := j d M Y y ai w[Y\X-t} 

d M Yy ai wp[Y 0 \X l3 ;t] 

+ \Ysj d M Yy ai w Py [Y^\X^ ; t] 

= J d M Yy ai w a [Y a \X a -,t] 
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+ \ (^2 1 rf M yy oi ^ 7 [y a7 |X^;<] 

N 

+ E / d M Y y ai w 0a [Y pa \X^-,t) 
0 J ) 
= m a ai (X a ,t) + ^2m a J(X a ^t) 

P 

i(X,t) := J d M Y y ai y ?j w[Y\X;t\ 

= E / d M Yy ai y Pj w 1 [Y'i\X' 1 ] t} 

P J 

+ &J d M Y y am w lS [Y^\X^;t] 



= 6 afi rn^iaj {X a ,t) + E m alaj (X ay , t) 

7 

+ (4.55c) 

Here the following notations were introduced: 

m T oj (r,0 := Jd M Yy ai w^\X^,t], (4.56a) 

ml\(X^,t) := j d M Yy ai w 7S [Y^\X^-,t\, (4.56b) 

ml m (X\t) := j d M Yy ai y 0j w y [Y^\X^,t], (4.56c) 

ml S i Pj (X yi ,t) := j d M Yy ai y^w y6 [Y^\X^;t). (4.56d) 

Now we integrate Fokker-Planck equation (4.55) in an analogous way as 
(2.12) over all variables xp with /? ^ a and again assume the factorization 

Pap(*onVp\t) := P a (x a ,t)Pp(xp,t) (4.57) 

of the pair distribution function P a p(x ay xp]t). Then we obtain, apart from 

J (Tx a P a (x a ,t) = 1, (4.58) 

the equations 

Q rt 

— P a {x ai t) = ^ ^ " Kai{&on t)Pa(&on Oj 
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4" 2 Qx ■ dx [^“*'1 ( a ' a ’ ^)d 3 a {x a , o] • (4.59a) 



The effective drift coefficients 



K a i(x a ,t) := m^ i (x a ,t) + '^2 j d m xpm^(x a ,xp,t)P 0 (x 0 ,t) (4.59b) 



and the effective diffusion coefficients 



Qcij(x a ,t) := m% iaj (x a ,t) + ^ j d m xprn“ l ? 0 , j (x a ,Xi3,t)Pp(xi3,t) (4.59c) 



make use of the conventions 



m a (x ai t) = m a (X a 1 t ), 
m a0 (x oi ,xp,t) = m a P(X a P , t) 



(4.60a) 

(4.60b) 



and take into account, each with its last term, the interactions between the 
subsystems /?. Moreover, we applied Gauss’ divergence theorem in order to 
convert the terms 




dx 0 



m ai pj(X,t)P(X,t) 



(4.61) 



for /? / a into vanishing surface integrals (cf. Sect. 4.3.2). Finally, we used the 
relation 

m 7 6 = 0 for 7 = 8 (4.62) 

which follows from (4.54). 

Distinguishing only A different types a of elements consisting of N a subsystems 
(cf. Sect. 2.3) implies 



and 



m 



a(3 



7a = 7a if £ a , 

Pa = Pa if € a , 

J d m xP a (x,t) = 1 , 

m a = fh a if a E a , 

( rh ai if a / /?, a E a, (3 E b 
0 if a = /?. 



(4.63a) 

(4.63b) 

(4.63c) 

(4.63d) 



(4.63e) 
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So we obtain (with respect to the considerably reduced number of variables) 
equations which are significantly simplified: 



^•Pa(*)0 — £)P a (a;,£)j 

+ 2 ^d^d^[ Qaij ^ x,t ^ x,t>> \ 

ij J 



with the effective drift coefficients 



K ai (x,t) := m“,(x,<) + ^ f d m ym a a b i (x,y,t)P i {y,t) 

b J 

= J d m x' Ax' i w a (x'\x,t) , 



(4.64a) 



(4.64b) 



the effective diffusion coefficients 




(4.64c) 



and 



(4.65a) 



m 



ab 



Ni • fh ab if 6 ^ a 

(N a — 1) • fh ab if b = a . 



(4.65b) 



The result (4.64) could also (and even very easily) have been obtained from 
the BoLTZMANN-like equations (2.22) by means of a second order Taylor 
approximation (cf. Sect. 9.2 and Ref. [184]). Thus it is suggested to denote 
equations (4.64) as the ‘Boltzmann-Fokker-Planck equations’. 



4.6.1 Self-Consistent Solution 

Due to factorization assumption (4.57) equations (4.64) are closed equations, 
i.e. they contain no functions (like, for instance, pair distribution functions 
Pap(x a ,xp;t)) which cannot be determined by the others. However, since they 
are non-linear in P a (x,t) due to the dependence of the effective drift and 
diffusion coefficients on P&(y , t ), we can only apply an approximation method to 
determine their solution. Having this in mind, we start from suitable probability 
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distributions Pa°\x,t) as e.g. the solutions of the Fokker-Planck equations 
which neglect pair interactions. These are given by 

K ai (x, t) := t ) (4.66a) 

and 

Qaij(x,t) := m a aia j{x,t ) . (4.66b) 

With Pa°\x,t) we evaluate the effective drift and diffusion coefficients for the 
next iteration step which gives the next approximation Pa^ix^t) etc. Hence 
our iteration method has the form 

(4.67a) 



with 

(4.67b) 
(4.67c) 

and 

(4.67d) 

If the sequences of the functions P^ k \x,t) converge for k — > oo, we have 

(4.68) 

The recursive method (4.67) is especially useful for the determination of sta- 
tionary solutions Pa(x). It formally corresponds to the quantum mechanical 
self-consistent field method of Hartree (cf. Refs. [49, 114]). An analogous 
recursive solution method can also be formulated for BoLTZMANN-like equati- 
ons (2.22). 

In the one-dimensional case (m = 1), the recursion formula for stationary 
solutions (x) of the Boltzmann-Fokker- Planck equations is particularly 
simple. Here the stationarity condition 

° = [^‘ ) (*)^. 0( ‘ +1) (*)] ( 4 - 69 ) 
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can, due to (4.33), be cast into the form 



Qa (#) 



exp 



X 



^V)\ 

Qa k \x') J 



with 

J dx P^ k+1 \x) = 1 . 



(4.70) 

(4.71) 




5 Langevin Equations and 
Non-Linear Dynamics 



5.1 Introduction 



For the modelling of stochastically behaving systems, apart from the master 
equation and the Fokker-Planck equation, often also stochastic differen- 
tial equations are used (cf. Ref. [84]). The most common one is the (general) 
Langevin equation 




(5.1) 



where 

0(Z,t):=J2Gu( z J)O(t) (5.2) 

J 

describes the contribution to the change of the state Z which is caused by the 

fluctuations 

m-= (6 (<).•• -,6f(<)) tr - ( 5 . 3 ) 

The Langevin equation is particularly suitable for the discussion of the sta- 
bility of a system state against fluctuations. If the system state is stable, the 
fluctuations will become clearly noticeable only after a long time period (the 
escape time) — namely, when the fluctuations accidentally add up. However, 
the fluctuations play an essential role at critical points (points of instabi- 
lity) at which the system is subject to phase transitions (i.e. discontinuous 
changes of its properties). In this context, the different routes to chaos are 
particularly interesting. 

If the fluctuations £(£) are ^-correlated and GAUSSian distributed (cf. (5.23)), 
F/(Z,£) describes just the most probable state change dZ(t)/dt of an in- 
dividual system. In addition, the description of the considered system by Lan- 
gevin equation (5.1) is equivalent to a delineation by the Fokker-Planck 
equation 






d d 
dZt dZj 



[qjj(Z,t)P(Z,t)\ 

(5.4) 
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with the drift coefficients 



(5. 



and the diffusion coefficients 



(5.5b) 



Whereas the Langevin equation reflects the fluctuation affected state changes 
of single systems, the Fokker-Planck equation describes the corresponding 
time-dependent state distribution of an ensemble of equivalent systems. 

According to (5.5a), the fluctuations yield a contribution to the drift via the 
term 

^ . (5.6) 

However, this contribution is usually negligible since Gu(Z,t) vanishes for 
e — ► 0 according to (5.5b). This neglection will be not allowed only if the 
relative average jump distance 6 (cf. (4.10)) is large. Omitting the term (5.6), 
the equations 

(5.7) 

for the most probable behaviour Z(t) of an individual system have , in the coor- 
dinates X := Zje, the form 



(5.8) 

These fully agree with the approximate mean value equations (1.179). The equa- 
tions for the most probable individual behaviour are especially suitable for the 
description of single or rare events. For the description of frequently recurring 
phenomena or of ensembles of systems, in contrast, one better applies the 
corrected mean value equations. 



dXi - 

ST = m,(XJ) 



^ = F,(Z,t ) ‘S' k,(Z,t) 



cqu(Z,t) := Y: Gik(Z , 1)Gjk{Z , t) 

K 




If we choose the matrices 

G(Z,t)={Gu(Z,t)) (5.9) 

in a symmetrical and positive definite form (cf. p. 292), by (5.5) we can ea- 
sily determine the Langevin equation which is equivalent to Fokker-Planck 
equation (5.4). For this purpose, we must only find G = G(Z, t) with 

G G - (Y, GikGkj ^ ^ GikGjk^ = e (?/^) = 6 £- (5.10) 




5.2. Derivation 
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Since the matrix cq = eq(Z , t) is, as a consequence of (4.14c), also symmetrical 
and positive definite, it can be represented in the form 

eq(Z,t) = Sr 1 (Z,t)D{Z,t)S(Z,t) (5.11a) 

with a diagonal matrix 

D(Z, t ) = (Dij{Z, 0) = (A j(Z, t) Su ) (5.11b) 



and a rotation matrix 5(Z,<). The matrix 



G(Z,t) := Sr\Z,t)D 1/2 {Z,t)S{Z,t) 



with 

D l /\Z,t)= (dJ' s (Z,o) := (\/A i(Z,t)6u) 

fulfills the demanded relation (5.10). 



(5.12a) 

(5.12b) 



5.2 Derivation* 



In the following we will supply the proof (first given by Stratonovich [267]) 
that Fokker-Planck equation (5.4), (5.5) is, on certain assumptions con- 
cerning the fluctuations £, equivalent to Langevin equation (5.1). This is a 
special case of the general problem to find for a given stochastic differential 
equation 



^ = „(m«.M) 



(5.13) 



the equivalent Kramers-Moyal expansion 



£ 



1=1 

(/i + ...+/m =0 
d‘i QlM 

X 



n 






M h ..., M (Z)P(Z,t) (5.14) 



8Z\' '"dZ'fi 
(cf. (4.12)). For the Langevin equation we have 

tf/(z(f),£(0) :=F I {z(t))+Y^G I j(z(t))o(t) 



(5.15) 



*This section can be omitted during a first reading. 
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if we, for the time being, assume that the coefficients Fj and Gjj are not 
explicitly time- dependent (Fj(Z,t) = Fj(Z ), Gjj(Z,t) = Gjj(Z)). Obviously, 
the problem is solved if the jump moments 



= ife, € -Af(( Zi{t + At) ■ • • • ' ( Zm(< + A<) - Zm ^) ) t+At 

(5.16) 

(cf. (4.47)) have been determined in dependence on the functions Fi(Z) and 
Gij(Z ). Here Z(t) denotes the system state at time t. 



First of all, for the determination of the jump moments we must evaluate the 
quantities 

' ' (5-17) 



A.Z/(At) i — Zi(t -j- At) — Zi(t) 



These are obtained in the limit 



AZ/(A*) = lim AZ{ k) (At) 

k— too 



(5.18) 



of an iteration method which, due to 




(5.19) 



is given by 



and 



t+A* 



AZ ( I 1 \At)= J dt' tf/(z(f ),£(*')) (5.20a) 



oo 

a 4 * +i) (ao = [ d? 52 52 

t '=o 



AZ^it'-tM ..... AZ%(t'-t) 



7^)( 



Im 



(L-f •• -Hm =0 

d h d l 






(5 - m) 

Here the quantity 

Hi m) = Hi (z{t) + A Z(f - 1), *(*')) (5.21) 



was represented as a Taylor series. 



5.2. Derivation 
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Those terms which are of order 0((A£) 2 ) do not contribute to the jump mo- 
ments (5.16). In spite of this, the relations we are interested in will be very 
complicated unless we make special assumptions about the fluctuations £(f). 
These are characterized by distributions 

^n(£ n > J • • • J £i j . . . dt n (5.22) 



which describe the probabilities with which the fluctuations . . . , £ n occur 
during the n differential time intervals + dt\] } . . ., [t n ,t n + dt n ]. In the 
following we assume 5-correlated GAUSsian random processes which are 
given by 




with the normalization constants Af n and 



xu(tut2) := 07 j < 5 (*i -h) 



Without loss of generality we can presuppose 



<?ij = &IJ 



(5.23a) 



(5.23b) 



(5.23c) 



which can always be achieved by means of a suitable variable transformation. 



The relation (5.23b) somehow corresponds to the Markov assumption of 
negligible memory effects since it implies that there is no correlation between 
fluctuations at different times. From (5.23) one can derive the relations 



(th(ti) ■ ■ ■ • £/ 2 *_i(<2fc-i)) = 0 



(5.24a) 



and 



■ ■■■■ Zl2k(t2k)j — ^ - t'2) ■ ■■•■ — tu) 

all 

pairs 



where the sum extends over all (2k)\/(2 k k\) different pairs that can be built 
with the elements of the set {(/ 1 , ti), . . . , (hkihk)} (cf. Ref. [241], pp. 23f). In 
particular, we have 



(m) = 0 



(5.25a) 



(£/i(<i)f/a(*2)) = k/i/ 2 (*i.*2) = -*2). 



and 



(5.25b) 




108 



5. Langevin Equations and Non-Linear Dynamics 



Now (5.20) allows, in connection with (5.23) and (5.24), the specification of 
the quantities AZi(At) as well as of the jump moments M\ x , ,./ M . A detailed 
analysis shows that the relevant terms which are proportional to At can already 
be obtained by means of the second order approximations AZj 2 \At). We find 

+ E [J dt" ^H I [z{t),i{t'))^j 

^ V ' 

= A 

+ 0((Atf) 

a ' 1 it t' 

dt'J dt“\Fj(z(t)) + £G,j r (z(o)fr(t")] 

t K 

+ 0((A<) 2 ) 



W(Z«>) 9Q, t (z(o) 

A?- y v a r _ ) 



= At- 



Fj{z(t)) 
+0((Af) 2 ), 



+ 



At 



dZj 

dG IL (z(t)) 



2 ^ dZj 

J,K,L J 



Gjk &KL 



(5.26a) 



(AZi{At)AZj{At)} = ( 1 / dt’Hj(z(t), £(?))] 

t+At t 

x [ J d*"tf,(z(0,£(O)])+o((A0 2 ) 

a ’' 1 \t t+At p 

df J dt" F,(z(t)) 



+ £G /jr (z(*))fr(0| | Fj(z{t)) 




5.3. Escape Time 



109 



+ '£G JL (z(t))t L (t") J + o((A<) 2 ) 

= A t G *k ( z (*)) Gjl (■ Z(t )) S KL + O ((At) 2 ) , 

K,L 

(5.26b) 

and 

(AZ h (At) ■ . . . ■ AZ/,(AO) = o((A0 2 ) for />3. (5.26c) 

This implies 

M/(Z(0) = Fi (Z(0) + — g/if (s(*)) , (5.27a) 

J)K 

*Mu(z{t )) = J G «W G «( Z W)' (5.27b) 

K 

c 1-1 (z(o) = 0 for / = / x + ... + /* >3. (5.27c) 

Relations (5.27) obviously agree with relations (5.5) (cf. (4.14b,c)). That is, 
if the fluctuations satisfy relations (5.23), Langevin equation (5.1) is equi- 
valent to Fokker-Planck equation (5.4) with the special drift and diffusion 
coefficients given by (5.5). 

For explicitly time-dependent coefficients, Fj(Z ) and Gij(Z ) have, in formulas 
(5.15), (5.26), and (5.27), only to be replaced by Fj(Z 1 t) and Gjj(Z y t) since 
the Taylor expansion of 

Hi(z(t'), £(?),?) = Hj^Z^ + AZi? + -t)) (5.28) 

in A t = (t' — t) exclusively leads to additional terms which, after carrying out 
the integrations (5.19), are of order 0((At) 2 ). 



5.3 Escape Time 

Often we would like to know how long it takes on average until a system leaves a 
stable state Z o due to an accidental summation of fluctuations. For illustrative 
reasons, let us assume a one- dimensional system which obeys the Langevin 
equation 



^ = F(Z) + Gm 



(5.29) 
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and thus can also be described by the Fokker-Planck equation 

= +f (MO) 

with k(Z) = F(Z) and the constant diffusion coefficient q = G 2 . (The con- 
stancy of the diffusion coefficients can always be reached by means of an ap- 
propriate transformation of the co-ordinates; cf. Ref. [241], pp. 88ff. Methods 
for the evaluation of the escape time in multidimensional cases are treated in 
[84], pp. 357ff.) 



Figure 5.1: For the de- 
picted double well poten- 
tial the states Zo and Z<i 
are stable. Despite this, 
after a long time period 
(the escape time) fluc- 
tuations lead to a passing 
of the potential barrier at 
Zi. 



HZ) 




<f>{Z) := J dZ' k(Z') (5.31) 

Zo 

be a double well potential with minima at Zq and Z 2 and a maximum at 
Z\ (cf. Fig. 5.1). Then the Arrhenius formula 



Air 



*9)J&HZo)\&HZi)\ 



2[«Z,)-*(Zo)]/(«») 



(5.32) 



can be derived for the average escape time T that a system needs to get over 
the potential barrier at Z\ starting from Zo for the first time (cf. Ref. [84], 
pp. 139fF). According to this formula, the escape time (which is also called 
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‘first-passage time’) increases more than exponentially with the system size 
Q, = 1/e. For systems which consist of N 1 elements the stability of state Zq 
is, due to Q, = 1/e = N, hardly endangered by fluctuations. 

Whereas the transition from Zq to Z\ is caused by fluctuations, the transition 
from Z\ to Z2 is dominated by the drift. Hence, for arriving in Z 2 from Zi, a 
small time period is necessary in comparison with T. 



5.4 Phase Transitions, Liapunov Exponents, 
and Critical Phenomena 

In order to explain phase transitions we best start from Langevin equation 
(5.1) but let F = (. . . , F/, . . .) tr and possibly also G = ^G/j^ depend on a 
control parameter k: 



— = F(Z,K) + G(Z,K)t(t). (5.33) 

k is given by external conditions and can sometimes be arbitrarily regulated. 
Disregarding fluctuations £(t) for the time being, (5.33) has the form 

^*F(Z,k). (5.34) 

After a finite transient time to the system will in general take on a stable 
temporal behaviour Zo(/c,£)- That is, for t > to the solution Zo(/M) with 

— Z 0 (M) = F^Zo(M),k;J (5.35) 

will practically be located on or move on an attractor (e.g. a fixed point, a 
limit cycle, or a torus). Considering a system which, at the time to , is in a 
neighbouring state Z(/c,<o) ~ Zo(/c,£o), the deviations 

<5Z(k, t) := Z(/c, t) — Zq(/c, t) (5.36) 



obey the equation 



d 6Z dZ dZ 0 

dt dt dt 

= F(Z,k)-F(Z 0 ,k) 

= L(k, t)8Z + 0 (||^Z|| 2 ) 



(5.37a) 
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where 

(5.37b) 



is the JACOBian matrix of the system. Since \\8Z(k, £ o )|| — ► 0 implies 
||6Z(/c, t) || — * 0, the solution of (5.37) for this limit can immediately be written 
as 

6Z(k , t) = M(t % to) 8 Z(k , to) (5.38a) 

with 

M(t s to) = T exp t , (5.38b) 

(cf. p. 299). Hence the deviations 5Z change exponentially in time. 

Now let ej be the normalized vectors of a basis of the state space. We are 
interested in the temporal course of the initial deviations 



L(K,t)= (Ljj(K,t)^ ^^-Fi^Z 0 (K,t),Kj) 



6Z r (K,t 0 ) := e/(Mo)ej 
in directions e/. Let us denote their magnitude by 



(5.39) 



(i(k, t) := ||<5Z/(k, t)|| = ||M(t, t 0 ) 8 Zi{k, f 0 )|| • (5.40) 



A measure for the mean exponential change of the magnitude e/(/c,£) of the 
deviations 8Zj(n,t) during the period (t — to) of time is 



A/(M) 



t-t 0 C/(Mo) 



due to 



f/(M) = e 






f/(Mo) 



(5.41) 



(5.42) 



Thus a criterion for the stability of solution Zo(/c,£) with respect to small 
deviations <5Z(/c,*o) are the Liapunov exponents [264,260] 



A j(k) lim lim 
t-^oo ej(/c,to)- 



A/(/c,£)= lim lim In 

►0 t-*oo e/(/c,t o )--+>0 — to 



ei(n,t 0 )) 



(5.43) 

It can be shown that the Liapunov exponents {A/(/c)} are independent of the 
choice of the basis {e/} [210]. 



• If all Liapunov exponents Aj(/c) are negative, small deviations will be 
exponentially damped. Then fluctuations do not disturb the stability of 
the solution Z 0 (/c, t) as long as they do not accidentally sum up (cf. Sect. 
5.3). 
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• If at least one Liapunov exponent A/(/c) is positive, a small initial de- 
viation <5Zj(/c,<o) will grow exponentially until it usually reaches a limit 
due to the non-linear terms 0(||£Z|| 2 ). Then a new solution Zi(/c,£) re- 
sults. (A detailed discussion is given in Reference [106] where it is also 
investigated which new patterns of motion result on which conditions.) 



With respect to a variation of the control parameter /c, those values /c,- for which 
a Liapunov exponent reaches zero and the solution Z,(/c,f) becomes unstable 
will be denoted as critical points. The phenomenon of a discontinuous 
change from one stable solution Zi(/c t “ , t) to another stable solution Z i+1 (K,f ,<) 
during the variation of the control parameter k from kJ /c,* — 6 k to := 
Ki + 8k is called a ‘phase transition 9 or, in chaos theory, a ‘bifurcation’. 
Phase transitions can only occur in non-linear systems and are very typical 
for those. In particular, they are denoted as noise induced transitions if 
caused by the fluctuations 5 contributions 






J,K 



dGu(Z,K) 

dZ K 



Gjk(Z,k) 



(5.44) 



to the drift coefficients kj(Z) (cf. Ref. [145]). 

A phase transition is preceded by critical phenomena: The deviation 
«Z/(Mo) that belongs to a Liapunov exponent A/(/c) which is slightly less 
than zero will be very slowly damped. Hence in this connection one speaks of 
critical slowing down. Additionally, owing to the small damping, fluctuati- 
ons in direction <5Z/(/c,f) become strongly noticeable, i.e. the dynamics of the 
system is influenced by the fluctuations £(<). These are denoted as critical 
fluctuations. 



5.5 Routes to Chaos 



Among the phase transitions (or bifurcations) the ‘routes to chaos’ are of spe- 
cial interest. There are several different routes to chaos (‘chaotic scenarios’) 
of which 

• the Ruelle-Takens-Newhouse scenario [197,254] via Hopf bi- 
furcations [144,254], 

• the period doubling scenario via pitchfork bifurcations [70,71, 16, 
254], 

• and the intermittency route to chaos via a tangent bifurcation [223, 
146,16,254] 
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are the most famous ones (cf. also [183, 135]). 

The common property of all chaotic phenomena is that the slightest change of 
the initial state of the system can, after a certain time, lead to a completely 
different dynamics. Thus, within the frame of all possible measuring accu- 
racy, the dynamic behaviour of a chaotic system is not any more predictable 
after this time (‘butterfly effect’). 

It already became clear in Section 5.4 that the Liapunov exponents are the 
decisive criterion whether the system behaviour is chaotic or not. However, a 
number of other methods also allow an examination for chaos (cf. Refs. [135, 
254,264]): 



• The measurement of the irregularity of the temporal course. For exam- 
ple, in case of intermittency, nearly periodic phases (so-called inter- 
missions) are randomly interrupted by relative short irregular bursts (cf. 
Fig. 8.24). 

• The analysis of phase portraits, i.e. of projections of the trajectory 
(which is the path of temporal evolution). If a phase portrait displays 
a closed curve, we will have periodic temporal behaviour (cf. Figs. 8.8, 
8.10, and 8.18). Chaotic as well as quasi-periodic behaviour is cha- 
racterized by trajectories which are not closed and do not end in a fixed 
point. In case of quasi-periodicity the curve closes only almost and then 
runs ‘parallel’ to one of its former curve segments (cf. Fig. 8.12b). Other- 
wise we have chaotic behaviour (cf. Fig. 8.21). 

• The investigation of Poincare sections (Poincare maps), which, 
instead of projections of the trajectory, depict its intersection points 
with a certain plane through the state space (cf. Refs. [222, 254]). Chaotic 
behaviour manifests itself by a cluster of infinitely many points whereas 
periodic behaviour is indicated by a finite number of points. 

• The evaluation of the power spectrum (cf. Sect. 5.5.2). For chaotic 
behaviour the maxima of the spectrum are dense (cf. Fig. 8.21). 

• The calculation of the Liapunov dimension Dry = Dky(k) which 
Kaplan and Yorke conjectured to be a measure for the dimension 
of the attractor. If A((/c) denote the Liapunov exponents A j(k) in a 
decreasing order (i.e. A^ > . . . > A( > . . . > A^) and if n is the largest 
integer number with 

£>;<«)> o, 

«=i 



(5.45) 




5.5. Routes to Chaos 



115 



the Liapunov dimension is defined by [152,135,254] 



Dky(k) 



A a;.(*) 

” + Uk + i(«)I 



(5.46) 



For all cases of a fractional Liapunov dimension we have chaotic be- 
haviour. The same applies to the Hausdorff dimension [99, 118, 180, 
254] and the correlation dimension (cf. Ref. [254], pp. 127If) which are 
more reliable and more usual measures for the dimension of an attractor. 

• The consideration of the autocorrelation function 



to+T 

C(At):= lim ^ f dt y(t)y(t + At) 

T—*oo 1 J 
to 



(5.47) 



which measures the average correlation of a time signal y(t) with itself 
at a later time t + At, For chaotic behaviour C(A£) generally decreases 
exponentially with growing At since the temporal evolution of y(t + A t) 
will become more and more unpredictable on the basis of y(t) (cf. Ref. 
[254]). 

In the following, Liapunov exponents and power spectra will be discussed in 
more detail. 



5.5.1 Ruelle-Takens-Newhouse Scenario and Liapunov 
Exponents 

The definition of the Liapunov exponents Aj(ac) was already given in Section 
5.4. They are a measure for the average temporal exponential change of devia- 
tions into various directions in the course of time (cf. (5.43)). If the sum of the 
Liapunov exponents is negative, a small volume AV^ will shrink to zero: 



(5.48) 



Then neighbouring trajectories converge to one and the same attractor and 
the system is called a dissipative system. 

Above all, we distinguish the following important cases: 

(1) If all Liapunov exponents are negative, there is Dky = 0 and the tra- 
jectory of the system moves towards a stable fixed point. 



lim AVYt) = 0 

t — ► oo v 7 
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(2) If one of the Liapunov exponents is zero and the other ones are negative, 
we have Dky = 1 and the trajectory approaches a limit cycle (cf. Figs. 
8.13b and 8.18). A deviation SZi(k , Jo) tangential to the limit cycle stays 
undeformed f then, whereas deviations 5Zj(/c,Jo) in all other directions 
are compressed. 

(3) If two Liapunov exponents are zero and the others are negative, there is 
Dry = 2 and the trajectory converges to a path which runs on a torus 
(cf. Fig. 8.10). Deviations in one of the two rotation directions remain 
undeformed whereas deviations in all other directions are compressed. 

(4) If at least one Liapunov exponent is positive, one is zero, and the others 
are negative, we have a strange attractor (cf. Fig. 8.21). In directions 
which correspond to positive Liapunov exponents a stretching of the 
deviations occurs, i.e. neighbouring trajectories move off from each other. 
A compression , i.e. an approach of neighbouring trajectories, however, 
takes place in those directions which correspond to negative Liapunov 
exponents. The alternation between compression and stretching processes 
leads to a folding which explains the sensitive dependence on the initial 
conditions and causes a tattering of the volume element AV (that is 
responsible for the strange shape of the attractor). 

For the Liapunov dimension of a strange attractor we find Dky > 2. In 
addition, Dky is generally no integer but a fractional number. Hence we 
speak of a fractal dimension [180, 17, 183]. 



We are confronted with a Ruelle-Takens-Newhouse scenario if a system 
goes in succession through the situations (1) to (4) along with a variation of 
the control parameter k. 



5.5.2 Period Doubling Scenario and Power Spectra 

The Fourier spectrum of a function y(to + 1) is defined by the function 



+oo 

/(w) := J dt e wt y(t 0 + f) 
-00 



(5.49) 



Roughly speaking, the Fourier transformation (5.49) decomposes a time- 
dependent function y(t 0 + J) into sinusoidal contributions of different frequen- 
cies u and amplitudes f(u). (If evaluating the Fourier spectrum of a time- 
dependent function which starts at time Jo by means of a computer, we inte- 
grate only from Jq up to a very large value of T. The mistake connected with 
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this appears predominantly for large values of u which are not of particular 
interest.) 



Often, however, the application of the logarithm 

log f(u) 



(5.50) 



of the power spectrum 



f(u) := /(w) 



(5.51) 



is preferred to the Fourier spectrum. One interesting property of /( u) can be 
discovered by means of the convolution theorem (cf. p. 295): 



+ 00 

|/(w)| = J a 



dte wi lim 

T-+oc 



to+T 

i J dt' y(t')y(t' + t) . 



(5.52) 



This relation agrees with the Wiener-Khinchin theorem [296, 158] 




(5.53) 



according to which the power spectrum is the Fourier transform of the au- 
tocorrelation function C(t) (cf. Refs. [84], pp. 15ff, and [264], pp. 113f). 

When considering the power spectrum /(u>) of a function y(t) = y(t + To) with 
period To, we find especially large amplitudes at the frequencies 



27T 

u;(n) := n • u>o with u> 0 := — (n G {0,1,2,...}). (5.54) 

io 



Thus a function with period Ti := i To has its spectral maxima at the frequen- 
cies 



with u>i := — (n € {0,1,2,...}). (5.55) 

-Li 

If, along with a change of the control parameter a system is subject to a 
period doubling (i.e. 7*+i = 7j|+i(/c,_|.i) = 2 • = 2 • 7<(/c,*)), the power 

spectrum displays maxima at the frequencies 

w<+i(n) = |wi (n€ {0,1,2,...}). (5.56) 

That is, additional maxima occur in the middle of those maxima which already 
existed before the period doubling. This will become especially clear in Figures 



uj(n) = nuji — — c^o 

i 
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8.18 to 8.21. In the limit of infinitely many period doublings we find a transition 
to an aperiodic or chaotic behaviour. Then the maxima of the power spectrum 
are dense, i.e. the power spectrum possesses an irregular appearance (cf. Fig. 
8 . 21 ). 

The period doubling route to chaos described above often occurs in systems 
which are periodically forced with a frequency u>q. 




Part II 



Quantitative Models of Social 
Processes 




Overview 



In Part II of this book the stochastic methods from Part I will be made applica- 
ble to the modelling of social processes. It will turn out that the differences with 
respect to physical models do not primarily concern methodology but rather 
terminology and system specification. This is the topic of Chapter 6. 

Since the characteristic properties of a system are reflected by the transition 
rates, we will discuss in Chapter 7 how to choose these for social processes 
in the most appropriate way. An approach which bases on the concept of uti- 
lity maximization is found to be particularly plausible. It takes into account 
the effort of behavioural changes and the incompleteness of information 
during a decision. 

Regarding the interaction rates, it is possible to classify three different kinds 
of direct pair interactions — imitative processes, avoidance processes, 
and compromising processes. The resulting BoLTZMANN-like model for 
behavioural changes of individuals includes, as special cases, a number of 
equations which have important applications in the social sciences. Worth men- 
tioning are the logistic equation for limited growth processes, the gravity 
model for migration phenomena, and the game theoretical approaches for 
the description of the cooperation or competition among individuals. 

In Chapter 8 the model for behavioural changes will be specified with an ex- 
ample concerning opinion formation. It shows that, apart from other cases, 
oscillatory or chaotic opinion changes are possible. In addition, we will discuss 
how spatial opinion spreading by diffusion or telecommunication can be 
taken into account. 

A reformulation of the model from Chapter 7 by means of Boltzmann- 
Fokker-Planck equations and Langevin equations respectively will be car- 
ried out in Chapter 9. This allows a mathematical specification of Lewin’s idea 
that the behaviour of individuals is guided by a ‘social field’. 

Chapter 10 is dedicated to the game dynamical equations which result for 
a special case of imitative processes. Thus immediately a stochastic formu- 
lation of game theory becomes possible. With an example for the competition 
of equivalent strategies we will explain the self-organization of behavioural 
conventions. 

The topic of Chapter 11 is the procedure of determining the parameters of the 
developed models from empirical data. It will show up that even in the case 
of incomplete data the quantities which are of interest can be estimated. As 
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a concrete example, the migration data of removals between the federal states 
of West Germany will be examined. In this connection also the procedure of 
successive model reductions will be illustrated. 

Of particular interest for model applications is the decomposition of the time- 
dependent utility functions with regard to explanatory variables. As examp- 
les we will discuss approaches for the behaviour of purchasers and voters. The 
evaluation of empirical data can afterwards be continued by means of progno- 
sis techniques. 




6 Problems and Terminology 



6.1 Terms 

This section explains the meaning of the terms that were introduced in Part I 
for social processes. 

6.1.1 System and Subsystems 

In connection with the social sciences we will use the term ‘system’ to re- 
fer to a social system. Its constituent elements (units, subsystems) are 
individuals which undergo social interactions with each other. 

6.1.2 State 

We will assume that individuals can be characterized by their properties . 1 
Let the state x combine those properties of an individual which are import- 
ant for the description of his/her behaviour in a given situation and which 
are subject to noticeable temporal changes. For example, in connection with 
opinion formation concerning a certain affair, x can combine the opinion i and 
location r of an individual: 

*:=(z>) tr . (6.1) 

6.1.3 Subpopulation 

Those properties of individuals which remain practically constant in time du- 
ring the considered behavioural changes but which influence these, are a conve- 
nient criterion for the subdivision of the examined total population into several, 
internally homogeneous subpopulations o. If the total population consists of 
N individuals and subpopulation a of N a individuals, we obviously have 




1 In this context, with properties we do not only mean the character traits but all quan- 
tities which are required for a complete description of an individual a. 
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Per definition, with regard to an investigated affair, individuals of the same 
subpopulation have a similar behaviour pattern (‘character’) whereas individu- 
als of different subpopulations differ in their behaviour pattern. A behaviour 
pattern is characterized by the individual transition rates w a , w a b • Thus, 
for individuals of one and the same subpopulation, we do not assume similar 
behaviours but only similar probabilities for equal behaviours. 

Normally, by subpopulations a we distinguish different types of behaviour a, 
but in some cases also the residence location r or other quantities can be de- 
cisive for the subdivision into subpopulations. The behavioural types a can 
mostly be characterized by a few personal circumstances or socio-economic 
criteria (e.g. age, religious affiliation, education, and level of income) the re- 
levance of which depends on the considered affair. This is the case since indi- 
viduals show a tendency to increase the consistency between their attitudes, 
behaviours, and social environment (cf. Fig. 6.1 and [119,120,37,213,74]) as 
a consequence of their attempts to maximize their utility U a {x^t) by the re- 
spective behaviour x. U a (x,t) consists of two parts: 

u a (x } t ) = B a (x ) t) + S(a5,f) (6.3) 

(cf. Ref. [77], p. 16ff). £ a (a;,f) stands for the attitude towards behaviour x 
or for the satisfaction which an individual of subpopulation a expects from 
behaviour *. S(x,t ) is the ‘social reinforcement’, i.e. the ‘reward’ or the 
‘punishment’ which an individual must expect as a consequence of behaviour x. 
Thus an individual has three different possibilities to maximize his/her utility: 

• The individual can decide on a behaviour x' with B a (x , ) t) > B a {x^t) 
instead of on behaviour x. 

• If, due to social pressure —S(x,t), an individual shows a behaviour x 7 
which does not agree with his/her attitudes, this will change the valuation 
B a (x f , t ) in the sense of a revaluation of behaviour x f . This phenomenon 
is known from psychology as dissonance reduction [74,32]. (By the 
way, an attitude change does not occur in the case of a sufficiently high 
social reward for a behaviour x ' that is in disagreement with his/her 
attitudes!) 

• The individual can also look for a social environment S(x,t) which has 
a positive attitude towards his/her behaviour a: ‘Birds of a feather flock 
together.’ 

All three possibilities cause the consistency between behavioural type and per- 
sonal circumstances to increase. 

A possible means for the empirical determination of suitable subpopulations 
is, for example, the cluster analysis (cf. Ref. [149], pp. 532ff). The number of 




6.1. Terms 



125 





Figure 6.1: Stability of 
cognitive structures ac- 
cording to balance the- 
ory [119,120]: Depicted 
are the relations (atti- 
tudes) of two persons 
a and /? towards be- 
haviour x and the rela- 
tion of the persons among 
each other. Positive re- 
lations (e.g. likings) are 
represented by solid lines 
( — ) and negative rela- 
tions (e.g. dislikes) by 
dashed lines ( — ). Only 
structures with an odd 
number of positive re- 
lations are stable (con- 
sistent). Unstable struc- 
tures are denoted as un- 
balanced. If a struc- 
ture is unstable, it will, 
in the course of time, 
turn into a stable one. 
In this connection, the 
most probable transitions 
are those which require 
the least changes of re- 
lations (economy prin- 
ciple) [35,243]. These 
transitions are symbol- 
ized by arrows as far as 
they result in a change of 
the attitude towards be- 
haviour x. Another possi- 
bility would be a change 
of the relation between 
persons a and /?. 
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subpopulations which should be distinguished depends on the desired precision 
of the description. However, it is not reasonable to use a lot of subpopulations 
because, in such a way, the resulting model will become very complicated and 
susceptible to accidental disturbances. 



6.1.4 Socioconfiguration 

The socioconfiguration denotes the vector 

n :=(••., ) tr (6-4) 

which describes the behavioural distribution within the whole considered social 
system. n% means the number of individuals with behaviour x within subpo- 
pulation a. Within the total population the number n x of individuals with 
behaviour x is given by 

(6.5) 



Moreover, since the N a individuals of subpopulation a are distributed over all 
possible behaviours x, we have 




5>2 = w. 



( 6 . 6 ) 



6.1.5 Interaction 

In the context of the social sciences, with interactions we mean social in- 
teractions. These are characterized by an exchange of information. The 
decisive interactions are those which are related to a behavioural change. For 
example, in the case of opinion formation interactions are given by conversa- 
tions or discussions. Depending on the kind of opinion changes initialized by 
these we distinguish various kinds of interactions — imitative processes, avoi- 
dance processes, and compromising processes. 

Of special interest are non-linear interactions for which effects are not simply 
proportional to causes. So the behaviour of the social system cannot simply be 
understood as a sum of the behaviours of the single individuals. This is due to 
the fact that an individual often exerts an influence on the social system which, 
in turn, acts upon the individual (feedback effect). One possible consequence 
of this is the emergence of self-organization phenomena. 
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6.2 Problems with Modelling Social Processes 

6.2.1 Complexity 



The topic of social sciences are individuals and their behaviours as well as 
the social structures which result from these (groups, organizations, societies, 
etc.). From the point of view of natural sciences the behaviour of an individual 
is decisively determined by the processes in his/her brain. As a consequence, 
to model the behaviour in an exact way we seem to require an exact description 
of the processes within the brain. The functioning of the latter essentially bases 
on about 10 12 neurons (nerve cells) and about 10 15 synapses (connections). 
Consequently, the detailed dynamics of the brain is extremely complicated. 2 
However, the numerous variables of the brain are non-linearly coupled so that, 
according to Haken’s Slaving principle’ (cf. Ref. [106], pp. 187ff), most of 
the fast variables can be adiabatically eliminated. Thus we expect that the 
behavioural dynamics is determined by relatively few slowly changing variables. 

In order to motivate this phenomenologically we proceed as follows: The behavi- 
oural changes of an individual a are, on the one hand, given by (environmen- 
tal) influences Q a j and, on the other hand, by variables I a k which reflect the 
internal processes of his/her organism, especially of his/her brain. Some of 
the variables I a k are influenced by variables Q a j via perception. It will prove 
convenient to subdivide the variables {I a k} into variables {A a i} which change 
very slowly , variables {X am } which describe the (macroscopic) behaviour, 
and the remaining variables {O an }: 

I a = (A a ,X a ,O a ) tr . (6.7) 

Then the exact description of behavioural changes is given by the transition 
rates 



w a(Qcn X a , O a \Q a , X aj O a ; t ) — w(Q a , A a , X a , O a \Q ai A a , X ay O a ]t) 



( 6 . 8 ) 



which describe the change of the variables Q a) A a , X ai and O a into the varia- 
bles Q^, A 7 a , X f a , and 0' a . The dependence on the quasi- constant variables 
A! a = A a appears only in the dependence of the transition rates on a. 



The number of internal variables O an which are not at all of interest (as far 
as the description of social processes is concerned) is extraordinarily large. 
However, the situation simplifies since an organism follows the necessity to 
react to a change of the external situation Q a fast enough by changing his/her 
behaviour X a in an appropriate way. Therefore, in many cases behavioural 



2 Already in electroencephalograms (EEGs) one can detect chaotic time signals [14,15]. 
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changes are already determined by the external situation so that they can be 
described by transition rates of the form 

w a {Q' a ,X' a \Q a ,X a] t). (6.9) 

That is, the knowledge of the internal variables O a is not necessary. The reason 
for this is that the (microscopic) internal variables O a , which are to manage 
an adequate behavioural change, must vary much faster than the (macrosco- 
pic) situative variables Q a and the (macroscopic) behavioural variables X a . 
Therefore, due to their larger changing rate (i.e. the faster time scale on which 
they change), variables O a can be adiabatically eliminated (cf. Sect. 1.2.3). 
This implies an enormous data reduction and information compression 
for the model. 

In other words: Normally the internal variables O a are ‘slaved’ (determined) 
by the slow situative variables Q a in order to guarantee an appropriate re- 
sponse (reaction) X a to an external stimulus Q a (cf. Fig. 6.2). Neverthe- 
less, the reaction X a to a stimulus Q a is not necessarily unique (deterministic) 
but can be one of several possible behavioural alternatives. 

Stimulus Response 




Figure 6.2: Stimulus-response scheme of individual behaviour. 

Clearly speaking, for the modelling of behavioural changes it is not absolu- 
tely necessary to comprehend all the different thought processes (trains of 
thought) which lead to these in detail, provided that the thought processes 
are significantly faster than those behavioural changes which result from them. 
So behavioural changes can be understood as reactions occuring with certain 
probabilities to the perceptions (stimuli) that take place. 

The stimulus-response scheme is a result of long-term evolution proces- 
ses and short-term learning processes. Thus it is not guaranteed in new 
or special situations for which an appropriate reaction scheme first has to be 
learned , e.g. by trial and error. It also fails in the case of very fast changes of 
the situative variables. Furthermore, the elimination of internal variables O a is 
not admissible in the case of behavioural changes which are caused internally, 
e.g. by hunger or other physical needs. 

In the following we will mainly consider cases in which the situative variables 
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are given by the behaviours Xp of other individuals /?. Then there is 



Q a = (X u ..., X a - U X a+1 , X N ) tr (6.10) 



and 



w a (Q' a ,X' a \Q a ,X a -,t) = w a (X[,...,X , a ,...,X' N \X 1 ,...,X a ,...,X N ;t). 

( 6 . 11 ) 



6.2.2 Individuality 

The individuality of human beings results from their complexity which causes 
that there are no two persons anywhere who behave exactly in the same way. 
That is, for the number nx(Q) of individuals which are in situation Q and 
show behaviour X we have 

n x (Q) » 1 . (6.12) 

However, a verifiable or falsifiable model for behavioural changes presuppo- 
ses the reproducibility of the considered behavioural changes. This can be 
achieved by restriction to clearly delimited and simple situations. Then it is 
not important how the single individuals behave in other situations and areas 
of life. The modelled kinds of behaviour may even be influenced by other be- 
havioural areas as long as these influences have, on average of all individuals 
of one subpopulation, no systematic effect. 

Simple situations can be characterized by a few situative variables qj E {Qaj 1 } 
and they concern only a few behavioural variables x* E { X a k '}• So it is usually 
sufficient to distinguish a few behavioural types a. Consequently, simple 
situations evoke behavioural changes which can be described by transition rates 
of the form 

w a {q',x'\q,x-,t). (6.13) 

Then for the number n“(q) of individuals who belong to behavioural type a 
and show behaviour x in situation q we have 

K(q) > 1 • (6.14) 

Simple situations are, for example, situations in which the individuals decide 
on 



• taking a certain position concerning a given affair, 

• purchasing a product of a given kind of good, or 

• voting a certain party. 
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6.2.3 Stochasticity and Disturbances 

The necessity of a finally quantum mechanical treatment of the microscopic 
processes of an organism and the quasi- adiabatic elimination of fast variables 
O n have the consequence that behavioural changes are subject to fluctuati- 
ons which stem from the dynamics of the (microscopic) internal variables O n 
(cf. Sect. 1.2.3). Because of these internally caused fluctuations for behaviou- 
ral changes it is only transition probabilities that are given, i.e. a different 
individual behaviour can result even in seemingly identical situations q. Hence 
individual behaviour is unpredictable in detail 

However, it is possible to develop stochastic models, i.e. statements about 
the probability of certain behaviours. We can check the probability statements 
if there exists a sufficient number of individuals who possess the same beha- 
vioural type a, show initially the same behaviour x, and experience the same 
situation q. That is, we have a similar situation as in quantum mechanics where 
also only probability statements can be made. In quantum mechanics such pro- 
bability statements can be checked by an ensemble of systems which possess 
the same interaction type, are initially in the same state (by preparation), 
and undergo the same kind of interaction. 

Further reasons for the necessity of a stochastic treatment of social processes 
are 



• unknown situative influences (disturbances) Qj> £ {qj} (cf. Sect. 
1.2.2) and 

• small individual variations of the behaviour pattern within one and the 
same behavioural type a. (The way of proceeding described in Section 2.3 
is only approximately possible, then.) 

The difficulties with disturbances are related to the complexity of individuals. 



6.2.4 Decisions and Freedom of Decision-Making 

From a psychological point of view the fluctuations of the internal variables show 
up in the decisions of an individual which are decisive for his/her behaviour. 

According to Feger’s empirically founded model [68, 69] decisions are conflict 
situations which will occur if there can be chosen between several each other 
excluding behavioural alternatives x E {®i, . . . , a?s}. At the beginning of 
such a conflict one values its importance. This is decisive for the time spent 
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on decision-making. Afterwards, an accidental sequence of consequence an- 
ticipations starts, i.e. a sequence 

*o, *ij *2j • • • (6.15) 

of imagined consequences i n of the possible decisions each of which favours a 
certain alternative x n = f(i n ). In other words: The decision-maker becomes 
conscious of the consequences which the choice of the single available al- 
ternatives would bring about. The arguments considered on this occasion are 
of a more or less accidental order and let the decision-maker alternate bet- 
ween the different possible choices. A final decision will be made if k successive 
consequence anticipations (i.e. k arguments) 

*n+l , • • • , in+k with f(in+l) = • • • = /(*«+*) = * ( 6 - 16 ) 

are in favour of one and the same alternative x. Here the number k is the larger 
the more important the decision is. 

A comparison of the consequences of the different alternatives continues even 
after a decision was made. This is felt as doubts about the rightness of the 
decision made. 



The preferences for the single alternatives * affect the decision process in such 
a way that they determine the probabilities with which consequence anticipati- 
ons i n occur in favour of them. If the differences between the preferences of the 
considered alternatives are small (so that the choice of the best alternative and 
thus the decision is ‘difficult’) the sequence of consequence anticipations up to 
the final decision will be particularly long (since k is chosen especially large, 
then). A broad spectrum of alternatives has a similar consequences (because 
in this case the probability that successive arguments are in favour of one and 
the same alternative is small). 



The just described relations can be also formulated in a mathematical way. Let 
U a (x\y]t) be the preference for alternative x for an individual a who has 
finally made decision y. Let p a (x|y; t) > Obe the probability for the occurence 
of consequence anticipation i which, due to f(i) = x, is in favour of alternative 
x. It is plausible that this probability increases with the preference U a (x\y;t). 
Therefore and in order to guarantee the non-negativity of p a (x \y;t) we will 
assume 



Pa(*\v;t) 



U a (x\y;t) 

e 

y^ e Mg'|y;<) 



e 



(6.17a) 



X 1 



with the scaled preferences 



U a (x\y,t) := U a (x\y;t) - In ly,t) 

X 1 



(6.17b) 
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Then the probability p f a (x\y]t) that the next k arguments are all in favour of 
alternative x is 



Pa(*|y ;0 = [pa(*|y;<)] = e tc/ “ ( * |y, ‘ ) . (6.18) 

As a consequence, the probability p a {x\y]t) of a decision on alternative x is 
given by 



(6.19a) 



provided that the final decision is actually made. In the following the quantity 



(6.19b) 



U' a (x\v\t) ■= kU a (x\y;t) 

X 1 



kU Q (x\y,t) 



\Pa(x\y;t) := 



kU^x'^t) 



— e 



u' a (x\y,t) 



will be denoted as the attractiveness of decision x for an individual a who 
chose behaviour y before. According to (6.19a), the probability p a (x\y; t) of a 
decision on the alternative x with the largest preference U a {x\y\ t) will increase 
with k. 



Interestingly enough, (6.17a) leads, in contrast to other conceivable approaches, 
to the desired property that the probability distribution p a (x\y;t) of conse- 
quence anticipations and the probability distribution p a (a?|t/;£) of decisions, 
independently of Ar, possess the same functional form, namely an exponential 
dependence. Moreover, from 



U' a {xi\y,t) - U' a (x 2 \y,t) 



k 

k 



U a {xi\y,t) - f/ a (* 2 |y;<)] 

U a (xi\y,t) - U a (x 2 \y,t)^ 



( 6 . 20 ) 



follows the important relation that the attractiveness difference U f a (xi\y;t) — 
U f a (x 2 \y;t) between two alternatives x\ and x 2 is k times as large as the 
difference U a (xi\y; t) — U a (x 2 \y] t) of their preferences. The larger the number 
k of arguments which successively must be in favour of the same alternative 
x for a final decision on * the more evident become the differences between 
the preferences of similarly valued alternatives. Thus an increase of k (which 
implies a prolongation of the time spent on the decision) can compensate for a 
small difference between the preferences for different alternatives. 

Of special interest in connection with decision conflicts is also the quantity that 
is denoted as the freedom of decision-making. According to psychological 
investigations the (subjectively felt) freedom of decision-making increases with 
the uncertainty with respect to the final result of a decision [115,265]. That 
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is, the freedom of decision-making is an entropy-like quantity. It is the larger 
the larger the number of alternatives is and the more equivalent they are (with 
regard to their preferences). A restriction of the freedom of decision-making 
gives rise to reactance (i.e. to a kind of a defiant reaction): Either the 
limitation of freedom will be evaded if possible, or resistance is formed [31, 
30,32,299]. 

6.2.5 Experimental Problems 

Experiments in social sciences are especially difficult for various reasons: 



• Too many internal variables play a role most of which are unknown. 

• It is not easy to find comparable test persons for experiments. 

• Many disturbing influences make clear and reproducible results difficult. 



These problems and the possibilities to avoid them were already discussed in 
Sections 6.2.1 to 6. 2. 3. Furthermore, among other things, there are the following 
hindrances: 



• Experiments with many test persons require much effort and high expen- 
ditures. 

• In some cases experiments must be ruled out for ethical reasons. 



However, these problems do not affect so much the models of this book for 
which we can, for example, use empirical data from opinion polls or election 
and market analyses. 



6.2.6 Measurement of Behaviours 

A further difficulty in social sciences is the quantitative measurement of beha- 
viours. However, this is quite simple for the quantities which are needed for 
the description of pedestrian behaviour (cf. Refs. [123-125,129] and [132], 
pp. 232ff). These quantities are the locations, velocities, densities, routes, and 
demands (‘shopping lists’) of pedestrians. 

Already more difficult is the quantitative measurement of opinions. For 
this task the meanwhile very disseminated method of semantic differentials 
was developed by Osgood [211,212]. A sematic differential characterizes an 
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Figure 6.3: Semantic differentials for two similar terms (according to [142 ] l 
pp. 35ff; cf. also [141]). 
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opinion i by the associated ratings d}(i) E [1;7] on L seven-point rating 
scales / that are defined by polar terms (adjectives) (cf. Fig. 6.3). 

For two semantic differentials 

and (6.21) 

a measure of distance 






N 



£(Ai)-JfcO ) 2 



( 6 . 22 ) 



can be defined which is a measure for the dissimilarity of opinions i and 
j (cf. Ref. [212], pp. 85ff). Factor analyses (cf. Ref. [149], pp. 401ff) show 
that most opinions can already be well characterized by using co-ordinates of a 
three-dimensional so-called semantic space where the co-ordinate axes can be 
interpreted as evaluation, activity, and potency. The dimension of evalua- 
tion corresponds to polar terms like good/bad, kind/cruel, or beautiful/ugly, 
the dimension of potency to strong/ weak or hard/soft, and the dimension of 
activity to fast /slow or active/p assive (cf. Ref. [212], pp. 36ff). 

A related method for the analysis of opinions will be introduced in Section 11.6. 



6.3 Summary 

In the last sections it was shown how the stochastic methods that were dis- 
cussed in Part I can be made applicable to a quantitative description of social 
processes. A comparison of the essential aspects of stochastic models in physics 
and the social sciences is given in Table I. 

Whereas up to now we only provided a general modelling strategy and a suita- 
ble terminological frame ) in the following chapters concrete models for various 
social processes will be developed. In this connection we will consciously avoid 
a ‘physicalistic’ way of proceeding which would transfer existing physi- 
cal models without critical inspection to the description of social systems by 
simply reinterpreting the variables. Instead, models will be presented each of 
which is geared to the characteristics of the particular social system and takes 
into account its peculiarities in an adequate manner. 
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Aspect 


Physics 


Social Sciences 


Total System 


Gas, Fluid, etc. 


Social System 


Subsystems 


Particles 


Individuals 


State 


Location, Velocity, 
Energy Level, etc. 


Behaviour, Opinion, 
Strategy, etc. 


Type of 
Subsystem 


Sort of Particle, 
Kind of Interaction 


Subpopulation, 
Behavioural Type 


Vector of Occu- 
pation Numbers 


Configuration 


Socioconfiguration 


Transition 


State Change 


Behavioural Change 


Interactions 


Attractive and 
Repulsive Forces 


Imitative Processes, 
Avoidance Processes, 
Compromising Processes 


Experimental 

Verification 


Restriction to 
a Few Effects, 
Preparation of an 
Ensemble of Ident- 
ical Particles 


Restriction to 
Simple Situations, 
Selection of Various 
Individuals of the Same 
Behavioural Type 


Stochasticity/ 

Uncertainty 


Fluctuations 


‘Freedom of Decision- 
Making’ 



Table I: Specification and comparison of the essential terms used in stochastic 
models in physics and the social sciences. 




7 Decision Theoretical 
Specification of the Transition 
Rates 



7.1 Introduction 

The abstract mathematical formulation developed in Part I of this book obtains 
a physical, chemical, biological, or sociological meaning only after the transition 
rates have been concretely determined. Approaches for the transition rates 
can be found by plausibility considerations but, in the end, they must 
be empirically checked (cf. Chap. 11). Especially, in connection with social 
sciences the decision theoretical approach 

(*' 

with 



yields good results (cf. Refs. [289] and [294], p. 19). Here 

• v a (t) is a rate which describes the flexibility of individuals of subpopu- 
lation a with respect to behavioural changes, 

• R a (x'\x] t) is the readiness of individuals of behavioural type a for a 
change from behaviour x to during a time interval l/i/ a (£), 

• the distance D a (x\ x\ t) = D a (x,x' ;t) > 0 is a measure for the effort 

(7.2) 

of a behavioural change from a; to aj', 

• U a (Xj t ) is the utility of behaviour x for an individual of subpopulation a. 






#*) ( 7 - la ) 
(7.1b) 



The larger the flexibility v a (t) } the smaller the effort S a (x\x]t) y or the larger 
the utility increase t) — [/ a (®, t) for a behavioural change from behaviour 

x to behaviour x' the larger is the corresponding transition rate w a (x'\x;t). 
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We will investigate approach (7.1) from various perspectives, now. In particu- 
lar, in Section 7.2.1 we will find that (7.1) takes into account the uncertainty 
regarding the determination of the (scaled) conditional utility 

Ua{x'\x, t) := Ua(x',t) - U a (x,t) - S a (x f , z; 0 (7.3) 

of a behavioural change (i.e. the incompleteness of information during a 
decision). 

Finally, we will specify the transition rates w a b{x' , y'\x, y\ t) for pair inter- 
actions. These can be decomposed into several contributions due to different 
kinds of interactions, namely into imitative processes, avoidance processes, 
and compromising processes. 



7.2 Derivation 



Transition rates w a {x , \x\t) can always be written in the form 



(*' t x) 



with 



W a {x'\x-,t) —p a (x'\x;t) 



o 

IT 

IT 

IV 

o 


> 


p a (x\x]t) := 1 - Y Pa(* , |*;0 






X'&X) 



(7.4a) 

(7.4b) 



(cf. (1.22b)). Here At means a short time unit and p a (x f \x]t) the probability 
to carry out a behavioural change from x to x ' during this time unit. At should 
be so small that we have 



Y Pa{x'\x;t) < 1 or p a (x\x;t) « 1 (7.5) 

x'(^x) 

so that there will not be several transitions within a time interval At. 



Due to the non-negativity of p a (x f \x]t) the unique representation 

with 






f/ a( a;, | a; ;0 : = lnp a (a: , |a!;<) 



(7.6) 



is always possible. The exponential representation (7.6) has only implications 
with respect to the interpretation of the functions U f a (x'\x] t). Assumptions 
concerning the form of p a (x'\x ; t) are introduced later along with the specifica- 
tion of U' a (x'\x ; t) (cf. Sect. 7.2.4). In the following we will see that (7.6) can be 
motivated in several ways. One derivation was already provided in Section 6.2.4 
in connection with the mathematical formulation of Feger’s decision model. 
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7.2.1 The Multinomial Logit Modell 

Let 

(7.7) 

be the conditional utility which an individual of subpopulation a expects 
from a behavioural change from x to x f . Due to the incompleteness of 
information the conditional utility [/*(*' |x; t) consists of a known part 
U a {x f \x; t) and an unknown part e a (a5'|a5;£) which can only be estimated. 
For example, if (after having purchased product x) one decides on the purchase 
of a product x' the price will, as a known quantity, contribute to U a (x f \x; t) 
whereas the durability can only be estimated and will be taken into account 
by c a (* / | x\t). e a (x'\x]t) varies from individual a E a to individual /? E a and 
from decision to decision. 

It is plausible to assume that the decision-maker will choose just that behavi- 
oural change 

(a?'|a;) x ' < — x (7.8) 

from behaviour x to behaviour x ' for which the conditional utility U*(x f \x] t) 
is maximal, i.e. for which 

U:(x'\x;t)>U:(y\x;t) (7.9) 

holds in comparison with all other possible behavioural changes ( y\x ). Thus 
the probability to choose ( x'\x ) among all possible behavioural changes ( y\x ) 
is given by 

p a (x'\x;t) 

:= Prob^|c^ a (a! , |x; <) + e^as'la:; t) > U a (y\x; t) + € a (y|a;; £) Vy^x'}). 

(7.10) 

If the randomly varying part e a (x'\x;t) is independent of x ' and distributed 
according to the extreme value distribution 






(7.12) 
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(7.12) is a generalization of the multinomial logit model which, in its original 
version, does not take into account any dependence on the present behaviour x 
(cf. Refs. [53, 209]). h a (x,t) is a measure for the uncertainty of information 
during the decision (and will normally be independent of x). With the affine 
transformation 



U' a (x'\x;t) 



Ug(x'\x]t) — 

^ a (®>0 „ 



(7.13) 



we arrive at the desired representation 

p a (a;'|a:;<) = e t/,,(l!|x ’ t) . (7.14) 

Obviously, it is just the scaled conditional utility Ua(x*\x;t) which is the 
decisive quantity for behavioural changes. With the terminology of Section 
6.2.4, U' a (x f \x;t) can also be called the attractiveness of behaviour x f for an 
individual of subpopulation a who has chosen behaviour x before. 

The application of the extreme value distribution is particularly reasonable 
since the maximum of two extreme value distributed variables is also extreme 
value distributed. From this point of view, it is the c natural distribution’ for 
maximization problems [53]. If, instead of (7.11), we had applied the normal 
distribution (which is compared to the extreme value distribution in Figure 
7.1), probability (7.10) would have not been explicitly calculable. 



1.00 



0.75 



Figure 7.1: Comparison of £ °- 50 
the extreme value dis- 
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7.2.2 Entropy Maximization 

The distribution (7.12) resembles the canonical distribution 

—E'(i)/(kT') 
e 

^2 e -E'U)/(kT‘) 

j 

from thermodynamics [236,245] where k denotes the Boltzmann con- 
stant, T' the absolute temperature, and E'(i) the energy of state i. (7.15) 
can be derived by maximizing the (information) entropy 

S:=-^P(i)log 2 P(i) or S’ := - £P(i)lnP(i) = 51n2 (7.16) 

i i 

on the constraints 

^^P(i) = 1 (normalization), (7.17a) 

i 

^£ ,/ (z)P(z) = E 1 (conservation of total energy E') (7.17b) 

i 

(cf. Ref. [107], pp. 53ff). 

A similar derivation also exists for (7.12). It already became clear that, due to 
U*(x f \x] t) = U a (x'\x] t) + e a (* / |x; f), the maximization of the known condi- 
tional utility U a (x'\x ; t) does not necessarily lead to the best decision. Rather 
it is reasonable to occasionally vary behavioural changes in order to determine 
the actual conditional utility U*(x f \x;t) of other behavioural changes more 
precisely and, as a consequence, to be able to really optimize the behaviour. 
Hence the individual will undergo a certain risk but choose the probabilities 
p a (x'\x;t) of behavioural changes in such a way that the expected conditional 
utility reaches a certain percentage p of the maximal known conditional utility: 

(7.18) 

Since there exist many of such distributions, we will select the one which is 
‘most likely’, i.e. the distribution which maximizes the information entropy 

- X! pa ( y i* ;< ) lnpa ( ?, i a:; *) ( 7 - 19 ) 

y 

on the conditions (7.4b) and (7.18). Instead, however, we can also introduce 
the Lagrange multipliers A *(t) and p x a (t) and maximize the expression 

- ^Pa(i/l*;<) ln Pa(y|*;<) 
y 




(7.15) 
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- /£(<) 



^2 Ua(y\x ; t)p a (y\x; t) - h' a (x, t ) 
■ y 




l ) 



;<)-! 



(7.20) 



without considering the constraints. For this purpose we must derive (7.20) 
with respect to p a (x f \x;t) and set the resulting term to zero: 



- lnp a (x'|*; t) - 1 - [i*(t)U a (x'\x; t) - (\*(<) - 1 ) = 0 . (7.21) 



The solution of this equation is 



Pa{ x \ X 1 0 






(7.22) 



Now the Lagrange multiplier A*(tf) must be determined from the normaliza- 
tion condition. This leads to 



p a (x'\x;t) = 



s -rt(t)U a ( X '\x-,t) 

Z a (x,t ) 



with the so-called state sum (partition function) 



Z a (x,t) := 






(7.23a) 



(7.23b) 



Finally, the Lagrange multiplier p%{t) should be chosen in such a way that 
constraint (7.18) is fulfilled. This constraint is equivalent to the more familiar 
formulation 



5 In Z a 

9y x a (t) 



1 dZ a 
Z a 9y%(t) 



= Ki*, 1 ) 



(7.24) 



Denoting the value which fulfills (7.18) and (7.24) by /4(aj,£) and setting 



ha(®) i) 



1 

/40M) ’ 



(7.25) 



(7.23) again takes on the form (7.12). The further argumentation continues like 
in Section 7.2.1. 



7.2.3 Fechner’s Law 

Up to now the derivations of the probabilities of behavioural changes started 
from the subjective conditional utilities U a (x'\x\t). In this section we will 
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assume that Vaix'lx; t) > 0 reflects the objective advantage of a behavioural 
change from x to x' . Then the probabilities p a (x'\x\t ) of behavioural changes 
{x'\x) are expected to be proportional to the objective advantages V a {x , \x] t): 



(7.26) 



Pa(x |*; t ) — 



Vg(x'\ X]t) 



(law of relative effect [228,229,251]). According to Fechner’s law 
[67], however, the subjective perception Uaix'lx^i) of an objective quantity 
V a (x'\x]t) scales logarithmically: 

U a {x\x\t) - h a (x,t)\n V a (x'\x] t) + c a (x,t) 

(cf. also Ref. [21]). Inserting 

V a (x'\x-,t) = 



(7.27) 

(7.28) 



into (7.26) and using (7.13), we again obtain (7.6) (cf. Ref. [174]). 



7.2.4 Utility and Distance Function 



In the following the transition probability 

Pa(x'\x] t ) = e U ^ x (7.29) 

will be specified by a concrete approach for U^x'lx] t) (cf. (7.35)). With this 
aim in view we decompose the scaled conditional utility [/'(x'|a;;£) into an 
antisymmetrical part 




and a symmetrical part 




Then we can write 

U' a {x' \x;t) = A' a (x'\x;t) - S' a {x' ,x;t) 



(7.30a) 



(7.30b) 



(7.31) 



which is still a general relation. Both contributions A^x' \x\ t) and S f a (x f , x f \ t) 
have a clear interpretation: 
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• x; £) describes those contributions to transition probability 

p a (x'\x]t) which are independent of whether the behavioural change oc- 
curs from x to x' or vice versa, from x ' to x. Thus S f a (x f ,x;t) can be 
understood as a measure for the (subjective) effort which accompanies 
the behavioural change. The quantity 



(7.32) 



can be interpreted as a kind of distance (dissimilarity) between two 
behaviours and is a measure for the complications of a behavioural 
change or for the incompatibility of two behaviours. Depending on the 
situation it reflects the time needed, travelling expenses, removal costs, 
or transaction costs. 

The larger the distance D'^x', x ; t) or the effort S f a (x\ x; t ) is the smaller 
the corresponding transition probability p a {x , \x\t). 

• A f a (x f \x;t) describes those contributions to transition probability 
p a (x f \x]t) which depend on the direction : If A' a (x'\x; t) is large for a be- 
havioural change from x to x', the value A' a (x\x f ]t) = — A' a (x f \x; t) for 
the opposite behavioural change from x' to x will be small. So A' a {x'\x\ t ) 
takes into account those factors which are in favour of a certain behavi- 
our x' in comparison with another behaviour x . One often chooses the 
special approach 



D' a {x', 0 ; = e 5 * (r ,x,t) = D 'a{ x > t) > 0 



A a ( x |*)0 U a (x , t) U a (x,t) 



(7.33) 



where U' a (x y t) denotes the utility of behaviour *. Here the advantages 
of a behaviour yield a positive contribution to the utility whereas its 
disadvantages yield a negative one. 

The larger the utility difference 

:=U' a (x';t)-U' a (x,t) (7.34) 

is the higher the probability p a (x f \x;t) of a behavioural change. 



In summary, the scaled conditional utility [/'(x'l*; f) for a behavioural change 
from x to x ' is given by the utility difference A x > x U f a of the potential behaviour 
x' compared to the current behaviour x ) reduced by the effort S' a (x\ x;t) of 
the corresponding behavioural change. The resulting transition rates read 

/ f \ .\ 1 / / 1 _*\ 1 U‘(x'\x\t ) 

w a (x I *;<) = |*;0 = 



(7.35a) 
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with 



, n .s U’(x',t)-U'(x,t)-S'(x',x,t) e 

p a (x \x;t) = e a ^ ’ ' K ’ ’ ' = - 



K(*'s)-<(x,t) 



D' a (x',x;t) 



(7.35b) 



However, we are not at all interested in the ‘staying rate’ w a (x\x; t ) because it 
does not have any influence on the solution of the master equation (cf. (1.23)). 
Thus we often prefer a representation of the form 



w a (x'\x;t) = v a (t)- 



U a (x',t)-U a (x,t) 

D a (x',x-,t) 



which is connected with (7.35) via the relations 



v a {t) := 



1 



D a (t)At 



D a {x\x-,t) := 



D' a {x',x-,t) 

D a (t) 



and 



U a (x,t) ~U' a {x,t)-C a {t) 



(*' ^ *) (7.36a) 

(7.36b) 

(7.36c) 

(7.36d) 



In order to define the distance D a (x',x;t ) and the utility U a (x,t) in a unique 
way, we demand, for example, 



i^l/ a (x,0 = 0 



and 



S(S 



_ Da(x',X]t) — 1 

' x?x' 



(7.37a) 



(7.37b) 



where S means the number of states x E j a = {* 1 , • • • , * 5 }. Of course, S must 
be replaced by S a if the state spaces j a consist of different numbers S a of states. 
(7.37a) shifts the mean value 



Ca(t) :=4E^a(*.0 



(7.38a) 



of the (scaled) utilities U a (x,t) to zero whereas (7.37b), by 



(7.38b) 
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defines a ‘unit of distance’ D a (t) in which (scaled) distances D a {x' } x\t) 
are measured. v a {t) is a flexibility parameter that reflects the frequency 
of behavioural changes per time unit. It is the larger the smaller the unit of 
distance D a (t ) is. 

Since the scaling condition (7.37b) replaces the normalization condition (7.4b), 
approach (7.36) can manage without staying rates w a (x\x; t). Via relations 
(7.36) and (7.38) the flexibility parameter i/ a (£), distances D a (x / ,»;£), and 
utility functions U a {x ) t) can be determined from empirical data w e a (x'\x\ t) of 
transition rates (cf. Chap. 11). 



Finally, it is worth mentioning that approach (7.36) also describes transition 
rates that correspond to the Arrhenius formula (5.32): Applying the notations 
from Section 5.3 we find 





u\z) = -~4>(Z) 


(7.39a) 


and 


D\Z X , Z 0 ) = ■ 4? • 


(7.39b) 


7.3 


Pair Interaction Rates 




Up to now only the transition rates for spontaneous behavioural chan- 
ges have been treated. In this section we will specify the transition rates 
^ab(^ f j 0 which are related to pair interactions. A particularly simple 

approach would be 




W ab (x' ,y'\x,y;t) := v a i{t)p ai (x' ,y'\x,y,t) 


(7.40) 


with 


Pab(x', y'|*, y; 0 := p a b(x'\x; t)pba(y'\y ; t) . 


(7.41) 


Here 







• v a b(t) — NbV a b(t) is the meeting rate between an individual of subpo- 
pulation a and individuals of subpopulation b (which, by the way, reflects 
the social network; cf. [36,29]), 

• Pab(& f , 2/ / |*j 2/; 0 means the probability that, due to an interaction of 
two individuals of subpopulations a and 6, the former one changes the 
behaviour from * to x 1 and the latter one from y to t/, 
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• Pab(&' |*; 0 describes the probability that an individual of subpopulation 
a exchanges the behaviour x for x ' due to an interaction with an indivi- 
dual of subpopulation b. 



Especially, we have 

X 1 y ; 



^2p ab {x'\x;t) = 1 . 

x' 

Thus the effective transition rates 



w a (x'\x\ t ) = w a (x'\x; t) + ^2^2 ^2 w “ b ( x '’ y' l*> y> 0 A(y, <) 

6 V »' 



of the BoLTZMANN-like equations 



^P 0 (a:, <) = <) - u> a (a:'|x; 0^o(*> *)] 



(cf. (2.22)) take on the form 



w a {x'\x\ t) = w a (x' |x; <) + X^ v a>>{ t )P^i x '\ x ^ 0 

b 



(7.42a) 

(7.42b) 

(7.43a) 

(7.43b) 

(7.44) 



by applying (7.40), (7.41). That is, the effective transition rates lose their de- 
pendence on the behavioural distribution P&(y,<) so that the effects of pair 
interactions can no longer be mathematically distinguished from the effects of 
spontaneous transitions. (This implies that they can no longer be separately 
determined from empirical transition rates w*(x f \x] £); cf. Chap. 11.) The rea- 
son for this dilemma is factorization assumption (7.41) which supposes that 
interacting individuals carry out their behavioural changes in a statistically 
independent way. 



Therefore, we are looking for a more sophisticated approach than (7.41). Let 



x',y’ < — x,y 



(7.45) 



represent pair interactions during which the interacting individuals change their 
behaviours from x and y to x ' and y ' respectively. Then we can classify these 
according to the following scheme: 



:}<»> 



*, x 

*,y 



(7.46a) 
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*’* x,y i X i y \ 1 ( 1 ) (7.46b) 

y,y < — x,y (x f y) J v ’ 

x,tf < — x,x (y'^x) 1 

*',y < — y,y (*' ^y) > (2) (7.46c) 

< — as, * (*' #*,y'j£a;) J 

x,y' < — x,y (a^y.y'^y.y' £x) ] 

x',y < — x,y (x£y,x‘ £x,x' £y) > (3) (7.46d) 

x',y' < — x,y (a^y,a' #*,y'#y,a / #y,y'#a) J 

y,* < — x,y (*#y) ) 

x',x < — x,y (x £y,x' ^ x,x' ^ y) > (4) (7.46e) 

y.y 7 < — x ,y (x ^ y,y' ^ y,^ ^ x) j 



The classification (7.46) of transitions (7.45) is complete because there exist 
2 2 = 4 transitions of the kind 



x\y* < — x y x (7.47a) 

and 3 2 = 9 transitions of the kind 

2/' < — *,2/ (7.47b) 

In the first case x f and y f can be chosen equal or unequal to sb, in the second 
case equal to as, equal to y, or unequal to x and y. 

By means of curly brackets five kinds k E {0, ...,4} of pair interactions 
were distinguished in (7.46) which can obviously be interpreted as follows: 



(0) By (0) situations are summarized in which two each other meeting indi- 
viduals do not change their behaviour. Since they are not connected with 
any temporal change of the probabilities P a (x,t) or of the socioconfigu- 
ration n, they will not be treated any further. 

(1) Interactions (1) describe imitative processes or persuasion proces- 
ses, i.e. the tendency of an individual to take over the behaviour of ano- 
ther individual (cf. Ref. [77], pp. 45 Iff). 

(2) Interactions (2) reflect avoidance processes where an individual gives 
up his/her behaviour in favour of another behaviour if meeting an indi- 
vidual with the same behaviour. Such aversive reactions show up, for 
example, in defiance or snobbish behaviour. 
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(3) Interactions (3) delineate compromising processes (in a broader sense) 
where an individual changes his/her behaviour for a new behaviour 
if meeting an individual with another behaviour. One can find such beha- 
vioural changes in situations where the own behaviour cannot be main- 
tained when confronted with another behaviour which is, however, also 
not convincing. 

In the narrow sense, with a compromising process we mean the special 
case 

x',x'< — x,y (x ^y,x' ^x,x' ^y) (7.48) 

of 

x\y'i — x,y (x^y,x' ^x,y' ^y,x' ^y,y' ^x) (7.49) 

where an individual only changes the behaviour in favour of another one 
if he/she can agree on this with his/her interaction partner. Thus we will 
call (7.48) an agreement process. 

(4) Interactions (4), like (1), describe persuasion processes. The difference, 
however, is that one of the individuals changes the behaviour though 
he/she could persuade the other one of his/her previous behaviour. Such 
processes occur only very rarely, i.e. they are very improbable. Conse- 
quently, they will be excluded from our further considerations. 



Instead of (7.41), we can now set up the following more sophisticated approach: 





pij,(*'l*; 0 • 1 = pLC*'!*; 0 if = y and y' = y 




ip»a(i/ , li/;0 = P6a(y , |y; < ) 


if y* = x and x 1 — x 


Pab(*', y'\x,y;t) := < 


o 

II 

o 

j£ 

-o 
tp a 

ft- 


if x' = y and y' ^ y 




o • pLiv'ly’ 0 = 0 


if y ; = x and x 1 ^ x 




pSt(* / l*; < )p» a (i/ , li/; < ) 


otherwise. 



(7.50) 



With this approach we give up the statistical independence of formulation 
(7.41) because the transition probabilities depend on the kind k of the pair 
interaction. This is a function of the behaviours shown by two individuals at 
the beginning and the end of their interaction: 

k = k(x',y'-,x,y). (7.51) 



Apart from this we assumed for an individual who persuades another individual 
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that he/she will keep the behaviour with probability 1 (confer first and second 
line). That is, he/she changes the behaviour with probability 0 (confer third 
and fourth line). 

From (7.46) and (7.50) we obtain the following transition rates for imitative 
processes, avoidance processes, and compromising processes: 

w abi x iV \ x iV^) = v a b{t)pl a {x\y] t)8 xx i8 x yi{l — 6 X y) 



“b Vab(t)Pab(y\&i O^yy'^y*' (1 ” ^*y ) > (7.52a) 

w ab( x >1/ I = Vab{t)pib{ x \ x: >t)Pba{y |*J ~ 5 xy ') 

+ ~ &y*') 

+ ^ad(0p26(* / |*;0Pda(l/l*>0^y( 1 - 

x(l -Sty*), (7.52b) 



wlbi?' iy'\ x >yi 0 

= ^(OPak^l^jOpL^lwO^K 1 “ ^y)(! - Syy'X 1 - 5 *y') 
+ ^(OPadC^kiOpLCylVJO^yy'C 1 “ kyX 1 - ^'X 1 “ <W) 

+ v a b{t)p a b{x l*iOP&a(2/ l2/>0(^ ~ ^»y)(l ~ ^ara; / )(l ~ ^yy') 



X ( 1 $yx' ) • 



(7.52c) 



The factors S xy stem from constraints of the kind x = y within classification 
(7.46), the factors (1 — £ xy ), in contrast, from constraints of the type x ^ y. 

Inserting the total transition rate 



(7.53) 



into the general BoLTZMANN-like equations 

^ Pa(x,t ) = Y2 [w a (x\x'-,t)P a (x',t)~ w a (x'\x;t)P a (x,t)] 

X 1 

+ Y2 ]L ]C w «‘( x > t/V> y; 0-P»(y» O-faC*'. 0 

b x 1 y y 1 

~ 12^2 J212 Wai ( x> ' y'\ x >y> 0A(y , <)*«(*> *) 

b x' y y' 



Wah(x',l/\x,v,t) := w k ai {x\ y'\x, y; t) 

k 



(7.54) 
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(cf. (7.43)) and using (7.52), (7.42) we find the explicit formula 



(7.55) 



Despite the differences between formulations (7.50) and (7.41) it turns out that 
for the special case 

P k ai{ x '\ x ^) =Pab(x'\x-,t) (7.56) 

(7.55) again leads to the BoLTZMANN-like equations (7.43b) with the special 
transition rates (7.44). These differences appear, however, in the master equa- 
tion (3.50), the covariance equations (3.22), and even in the corrected mean 
value equations (3.47). 

In the following, for p* b (x f \x;t) with x* ^ x, we will assume a relation of the 
kind 

(7.57a) 

As a consequence, for the staying probability p k ab (x\x\ t) this implies 

p k ai (x\x; <) = 1- f k ai (t) £ R a (x'\x; t) . (7.57b) 

x'(^x) 

Here 

• fa b (t) is a measure for the frequency of interactions of kind k for 
individuals of subpopulation a in interactions with individuals of subpo- 
pulation 6, 

• R a (x f \x]t) is a measure for the readiness of an individual of subpo- 
pulation a to change from behaviour x to x' in interactions with other 
individuals. 
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In order to arrive at (7.57) we choose for p* 6 (a % J\x\t) an approach analogous to 
that of Section 7.2.4: 



p k ab (x'\x;t) 



j UZ b (*',t)-U k ab (T,t) 

D k ab (t ) D k ab (x',x;t) 



(*' ? *) 



(7.58a) 



with 



D a bi x i x >0 — D a b(x , : 



>o. 



(7.58b) 



The quantities U* b (x y t) have also the meaning of (scaled) utility functions 
and the quantities D* b (x' y x\t) the interpretation of (scaled) distance func- 
tions, however, this time for behavioural changes which are induced by interac- 
tions of kind k with individuals of subpopulation b. For reasons of simplicity, in 
the following we will assume that for the scaled utility and the scaled distance 
it is neither of importance to which subpopulation b an interaction partner 
belongs nor which kind k of interaction occurs: 



U k b (x,t) := U a (x,t), 
D ab {x ,x',t) := D (x ,x]t ) 



(7.59a) 

(7.59b) 



Thus the scaled utility and distance functions depend only on the acting indivi- 
dual. This assumption is not always justified but it can be dropped if required. 
In any case, assumption (7.59) still admits that the utility functions U a {x ) t) 
and distance functions D a (x , y x\t) which describe behavioural changes due to 
pair interactions differ from the utility functions U a (&,t) and distance functions 
D a (x' y x;t) which concern spontaneous behavioural changes. The dependence 
of the transition probabilities p* b (x f \x\t) on the subpopulation b to which an 
interaction partner belongs and on the kind k of interaction is described by the 
‘units of distance 9 D* b (t). These determine, by 




(7.60) 



the frequencies with which individuals of subpopulation a show behavioural 
changes of kind k in interactions with individuals of subpopulation 6. For rea- 
sons of uniqueness, we again introduce constraints of the form 



and 



X 



(7.61a) 



1 

5(5-1) 



x^x' 



(7.61b) 
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Now, with the definition 



i2 a (a?'| x;t) := 



U a {x , ,t)-U a (x y t) 

D a (x',x;t) 



(*' # *) > 



(7.62a) 



we obtain the desired relation (7.57) which simplifies the BoLTZMANN-like equa- 
tions (7.55) considerably. After introducing the abbreviations 






and 



(7.62b) 



v a (x',x;t) := 

+ (*&(0 - *&(*)) p t >( x ’ *) + *&(*)] > 
equations (7.55) take on the clear form 



— P a (*, t) = ^2 ™ a (*|«'; t)Pa{*\ 0 - w a (x f \x] t)P a (x, t) 



(cf. (7.43)) with the effective transition rates 



w a (x'\x ; t) = w a (x , \x; t) 4- i/ a (a5 ; , *; t)R a (x f \x]i) 



(7.62c) 



(7.62d) 



(x f ^ x) . (7.62e) 



Here 



• i/ a (* / , x\ t) are rates which are a measure for the total frequency of inter- 
actions of an individual of subpopulation a with other individuals, 

• u ab(^) * s the ra ^ e interactions of kind k between an individual of 
subpopulation a and individuals of subpopulation 6. 

By assumptions (7.50), (7.57), and (7.62a) (which can be dropped if requi- 
red) the general BoLTZMANN-like model (7.43) was considerably simplified and 
specified. Nevertheless, the resulting model is still able to describe the essen- 
tial characteristics of imitative, avoidance, and compromising processes (cf. 
Chap. 8). Moreover, it contains a number of established models as special ca- 
ses. 
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7.3.1 Special Applications in the Social Sciences 

In the following, in order to illustrate the versatility and explanatory power of 
the BoLTZMANN-like equations (7.62), we will discuss some special cases which 
are well-founded and widely disseminated in the social, regional and economic 
sciences. Many of these special cases are empirically validated so that probably 
also the BoLTZMANN-like model in its general form (7.62) will be proved. 



• The logistic model [217,281] describes limited growth processes. 
Examples of applications are the growth of towns as well as the diffusion 
of information or innovations [18, 110, 187, 206] (cf. also p. 2). 

In order to derive the logistic model let us distinguish two behaviours 
x G {1,2} and one subpopulation a only. Then there is A = 1 and 
Pi(l,£) = 1 — Pi(2,£). x = 2 could have the meaning to apply a certain 
strategy (e.g. the newest technology) whereas x — 1 would then mean 
not to do so. If only imitative processes 

2, 2 < — 1,2 (7.63) 

and spontaneous decay processes 

1 <— 2 (7.64) 

occur, we finally find the logistic equation 



|Pi(2,0 = -u; 1 (l|2;0Pi(2,<) + ^ 1 i(<)« 1 (2|l;0[l-A(2,<)]i 3 i(2,0 



= j 4(<)Pi(2,<)[s(<)-7 , i(2,<)] 


(7.65) 


(cf. (2.66), (8.67)) with 




A(<) := i/ 1 1 1 (0P 1 (2|l;<) 


(7.66) 


and 




RM.-1 «»i(l|2;*) 

U ' ^(0^(211; <)' 


(7.67) 



• The gravity model delineates exchange processes between different 
locations a;. It is used to describe e.g. good exchange, migration, or traffic 
flows between various towns [235,305] (cf. also p. 3). 

The gravity model results for w a (x f \x; t) = 0, v\ h { t ) = 1, v% b (t) = 0 = 
i^(£), and A = 1. Then, for the flow of exchange w 1 (x , \x;t)Pi(x i t) 
from x to x' y we get 



w 1 (x'\x,t)Pi(x,t) = vh(t)e 



try ,«)-t/y«) Pi(x',t)Pi(x, t ) 
D 1 (x',x;t) 



(7.68) 
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This is, on the one hand, proportional to the probability P\(x,t) which 
is a measure for the number of individuals who are at place x and 
might be interested in an exchange with another place. On the other 
hand, it is proportional to the probability P\ (x\t) which is a measure 
for the number of potential exchange partners at place x ' . Apart from 
this, the exchange rate is reciprocally proportional to the transaction 
costs D l (X) x '\ £), i.e. to the costs of exchange. D 1 (x i x*\ t) is often writ- 
ten as a function of the metrical distance ||se — *'|| between x and x f : 
D 1 (x, x'\ t) = D 1 ( \\x - x'||). 

In the original version of the gravity model the term exp[f/ 1 (x / ,<) — 
U 1 (x J t)] does not appear which corresponds to the case U 1 (x ) t) = const . 
If this case does not apply, it will be reasonable to denote (7.68) as the 
generalized gravity model. For the latter we obtain the dynamical 
equation 

± = „(o y _ /c.o-t'OM)] p(*,<)p(*'.o 

dt K v } V L D(x,x‘;t) 

(7.69) 

where, for reasons of clarity, all subscripts and superscripts were omitted. 

• The game dynamical equations [137, 271, 303, 139] model the compe- 
tition or cooperation of individuals. They belong to the most power- 
ful mathematical models in the social and economic sciences. The game 
dynamical equations also play a key role in the theory of evolution 
where they are known as the mutation- selection equations or Eigen- 
Fisher equations [60,78]. Their derivation and detailed discussion is 
the topic of Chapter 10 (cf. also pp. 3ff). 



7.4 Properties of the Utility Approach 

7.4.1 Stationary Distribution 

Neglecting pair interactions ( w a b = 0) for the time being and inserting (7.1) 
into (7.43) we have, for time-independent transition rates, the master equation 



-p„(*,o=e p.wa-* D ' (t . tX) • 



(7.70) 
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Due to D a (x,x f ) = D a (x\x) its stationary solution is obviously 



*?(«) 



2 UJx) 

e 

X 1 



(7.71) 



and fulfills condition (1.118) of detailed balance. As a solution of the cor- 
responding configurational master equation (3.50) we find, according to 
(3.3), 

A 

P(n,t) = l[P a (n a ,t) (7.72a) 

a= 1 



with 




(7.72b) 

In the first moment it seems that a similar solution as for (7.70) can also be 
found for imitative processes: The corresponding BoLTZMANN-like equations 




(x,t) = 

x> b 






I "at 



y ab~ 



D a (x , X*) 
U a (x’)-U a (x) 

D a (x f , x) 



/\ t)Pa{ x j 0 



-P b (x',t)P a (x,t) 



(7.73) 



let us expect the stationary solution (as) with 

= e U W- U 'Vp°(x)'£ V l t P!(x') 



Indeed, the approach 



P° a (x) ^Ma^Y^vlP^x) 

b 



Va,aj. 

(7.74) 

(7.75) 



with the normalization constant N a satisfies condition (7.74). However, at the 
same time it is a system of linear equations which can be solved only on 
certain conditions (cf. Ref. [20], pp. 120ff) which are usually not fulfilled. 
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7.4.2 Contributions to the Utility Function 

The utility function can be often split up into several contributions of different 
meaning (cf. also Sect. 11.6): 



U a (x,t) = J2 C '° U ° (*’*) 

l 



(7.76) 



Here the functions U l a (x,t) are denoted as explanatory variables. If U l a (x,t) 
reflect the contributions of objective advantages V^x^t) > 0, due to 
Fechner’s law they will enter in the utility function logarithmically (cf. 
Sect. 7.2.3): 



[/'(*,<) = h'jn Vi(x,t) + In C’ a = In 




(7.77) 



Then (7.76) can be represented in the Cobb-Douglas form [55,199] 




(7.78) 



in which the quantities V*(x,t) enter via power laws. 




8 Opinion Formation Models 



8.1 Introduction 

Since the expression of an opinion represents a behaviour, the change of (ex- 
pressed) opinions is a suitable example for behavioural changes, particularly 
because, by means of opinion polls, a direct comparison with empirical data 
is possible (cf. Chap. 11). The laws of opinion formation have already long 
been the topic of social research. For example, Osgood and Tannenbatjm 
with their congruity principle [213], Heider with his balance theory [119, 
120], and Festinger with his dissonance theory [74] (cf. Fig. 6.1) developed 
formal models which deal with the stability of attitudes and with the probabi- 
lity of certain attitude changes. An overview of the state of related research is 
given in References [77,220]. 

In contrast to the above mentioned models, in this chapter we will treat mo- 
dels which describe the temporal change of the proportion P(i, t) of individuals 
who show a certain behaviour i. Such models are of big interest for a quanti- 
tative understanding of opinion formation processes and for the prognosis 
of trends in public opinion (e.g. of tendencies concerning the attitudes of 
voters) . 

Coleman’s diffusion model was one of the first mathematical models of this 
kind. It distinguished two types of opinion changes — opinion changes by direct 
interactions and opinion changes which are induced by external influences. Sto- 
chastic models for the diffusion of rumours and news were presented by 
Bartholomew [18]. In the literature one can find still many more diffusion 
models [50, 98, 110, 157, 178, 247]. 

The most advanced model stems from Weidlich and Haag [292,289, 285, 286, 
294]. It was taken up by several authors in one or another modification [65, 257, 
274]. Weidlich and Haag’s model bases on a stochastic formulation with the 
master equation and assumes that, independently of each other, individuals 
exchange their opinions for other ones. This happens with a certain transition 
rate (transition probability per time unit) which depends on the frequencies of 
the different expressed opinions. In other words, the approach of WEIDLICH and 
Haag assumes indirect interactions between individuals. These can best be 
imagined to be induced by the mass media (i.e. TV, radio, newspapers, or 
magazines) (cf. Fig. 8.1). 

However, opinion changes are also possible by means of direct interac- 
tions which play a particularly important role in conversations or discussions 
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Figure 8.1: Schematic 

representation of the 2 N 
indirect interactions be- 
tween N = 3 individu- 
als. Indirect interactions 
are induced by a mean 
field which describes the 
effects of public opinion 
and the media. 



mean 

field 



(cf. Fig. 8.2). In this connection we will restrict ourselves to direct pair 
interactions since they normally yield the most significant contribution. 



Figure 8.2: Schematic re- 
presentation of the N\ di- 
rect interactions between 
N = 3 individuals. Di- 
rect interactions domi- 
nate in conversations or 
discussions. 




o 




o 
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Their mathematical description bases on the BoLTZMANN-like equations (cf. 
Sect. 7.3) which also include the effects of spontaneous behavioural changes. 

According to Section 7.3, three kinds of direct pair interactions can be 
distinguished — imitative processes (persuasion processes), avoidance 
processes, and compromising processes. All three kinds of interactions 
can, in the course of time, lead to stationary proportions of opinions. However, 
there can also occur more complex patterns of opinion changes as, for example, 
oscillatory or chaotic opinion changes. 

Some representative cases will be illustrated by means of computer simulations. 
By dividing the total population into different subpopulations several beha- 
vioural types a are taken into account. The interaction frequency between 
the individuals of different subpopulations is given by their likings. 

Of course, the models for direct and indirect interactions can be integrated in 
a general model (cf. Fig. 8.3). Moreover, a treatment of the spatial diffusion 
of opinions is also possible. 




Figure 8.3: Direct and in- 
direct interactions of indi- 
viduals can be described 
together by one general 
mathematical model. 



8.2 Indirect Interactions 



In the following we will briefly introduce the Weidlich-Haag model for 
spontaneous behavioural changes due to indirect interactions (cf. Ref. [292, 
289] and p. 11). In its most general form it was developed for migration 
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processes (removal processes) [288,294] but it can be also applied to the 
development of towns [293,194,295] or to opinion formation processes 
[285,286,292,290]. 

The behavioural model of Weidlich and Haag results for w a t, = 0 and bases 
on the master equation 



^P(n, t) = w{n\n'\ t)P(n f , t) — w(n f \n; t)P(n, t) 



( 8 . 1 ) 



with the configurational transition rates 



w(n'\n] t) 



w a (j\i; n; t)nf if n' = »£ 
0 otherwise 



(8.2a) 



(cf. (3.50)). Here we have utilized x = i since, by the states sb, we only distin- 
guish opinions i E {1, . . . , 5). 

One speaks of indirect interactions due to the dependence of the individual 
transition rates 









(8.2b) 



and the utilities U a (i, n; t) on the so cio configuration n. The corresponding ap- 
proximate mean value equations (3.16) read 



d(n?) 

dt 



X >(^;0 

3 



■ U a (i,(n))-U.U , <»)), a . 

e ( n j ) 







( 8 . 3 ) 

Since these are non-linear they are expected to display a complex temporal 
evolution (cf. Sects. 5.4 and 5.5). 

For (8.2b) the Weidlich-Haag model usually assumes the relations 



"a(i,j;0 = v a {j,v,t) 



(8.4a) 



and 



(8.4b) 
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v a (j, denotes the flexibility or mobility of subpopulation a with respect 
to a transition from i to j . £ a (i) is the inclination 1 of individuals of subpo- 
pulation a towards opinion i. K a b are coupling parameters which describe 
the influence of the behavioural distribution within subpopulation b on the be- 
havioural changes of subpopulation a. Sometimes K a b can be also interpreted 
as agglomeration parameters (cf. Ref. [293]). For /c a & > 0 the quantity 
reflects the social pressure which is exerted by the behavioural majority. Fi- 
nally, the coefficients a a bc have the meaning of saturation parameters. In 
the following we will assume the common case 

v a (j,r,t) = v a (t) = v (8.5) 



and 

d a bc = 0 ( 8 . 6 ) 

if nothing else is mentioned . 



8.2.1 A Period Doubling Route to Chaos 

An interesting case is found for A = 2 subpopulations and 5 = 3 different 
opinions i G {1,2, 3}. Namely, for 

*.(0 = 0, («„») := (* _ }' 5 ) , (8.7) 

along with a variation of the control parameter k £ [0,1.5], we traverse 
a period doubling scenario which leads to chaos (cf. Ref. [237]). A more 
detailed description of this scenario is given in Section 5.5.2. 



8.2.2 A Ruelle-Takens-Newhouse Route to Chaos 



Now let us again assume 5 = 3 different possible opinions i but A = 3 subpo- 
pulations. Then, for 



^a(*) — 0 > ’ 



/ 1.7 1.5 -1.5 

-1.5 1.7 1.5 

\ -k -1.5 1.7 



( 8 . 8 ) 



we pass through the Ruelle-Takens-Newhouse scenario along with a va- 
riation of the control parameter k £ [—1.5, 1.5] (cf. Sect. 5.5.1). For increasing 
values of k as attractor we subsequently find a limit cycle, a torus, and a 
strange attractor (cf. Ref. [103]). 



1 In the work of Weidlich and Haag this quantity is denoted as preference. However, in 
order to distinguish 8 a (x, t) from U a (x\y; t) the author decided to introduce another term. 
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8.3 Direct Pair Interactions 



In conversations or discussions opinion changes occur due to direct inter- 
actions (cf. Ref. [77], pp. 451ff, and [220]). The following considerations are 
restricted to pair interactions (dyadic interactions) since they normally 
play the most significant role. Even in groups the most frequent interactions 
are alternating pair interactions — not always but in many cases. In situations 
where simultaneous interactions between more than two persons are essential, 
a generalization is necessary. This is possible with the methods presented in 
Section 3.4. 

Pair interactions are described by the configurational master equation (8.1) 
with the special transition rates 



w(n f \n;t) := 



j'\i, j; t)n“n) if n' = »?,),?) 
0 otherwise 



(8.9) 



(cf. Sect. 3.2.2). The approximate mean value equations of this master equation 
are, according to (7.54), the Boltzmann equations 



b i> j y 

+ X) i'l*. i; tWh t)Pb(j, t ) 

t «•' j j' 



( 8 . 10 ) 



(w a i = NbW a j). Because of 

Pa(i,t)> 0 and ^P a (i,f) = l, (8.11) 

i 



the quantity P a (i,i) = ( n i)t/N a can be interpreted as the proportion (frac- 
tion, share) of individuals of subpopulation a with opinion i or as the pro- 
bability with which a randomly selected individual of subpopulation a holds 
opinion i. Among the total population the proportion P(i, t) of individuals with 
opinion i is given by 




( 8 . 12 ) 



where N denotes the size of the total population and N a the number of indivi- 
duals of behavioural type a. 
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8.3.1 Kinds of Pair Interactions 

As derived in Section 7.3, three kinds of direct pair interactions can be distin- 
guished: 

1. Imitative processes (persuasion processes) which describe the ten- 
dency to take over the opinion of another individual. 

2. Avoidance processes which reflect an aversive behaviour where an in- 
dividual changes his/her opinion when meeting another individual with 
the same opinion (defiance behaviour, snob effect). 

3. Compromising processes which describe the readiness of an individual 
to accept a new opinion if the old opinion cannot be maintained when 
confronted with the opinion of another individual whose opinion does not 
persuade either. 



These kinds of pair interactions can be described by transition rates (7.52) 
together with (7.57). The corresponding Boltzmann equations read 




( M ) = ^2 [« , a (*| t , ; 0 - p a (* / .0 

«•' 



- w a (i'\i-,t)P a (i,t) 



with the effective transition rates 



w a (i‘\i;t) := i/ a (i* ,i;t)R a (i f \i\t) 



(*V0 



(8.13a) 



(8.13b) 



and 



0 : =£[( v 2»(0- v irt(0)A(* , ,0+ + *'.»(<)] • 

b 

(8.13c) 



• iA| 6 (tf)P&(i',2) describes persuasion processes by individuals of subpopu- 
lation b with opinion i f . 

• reflects avoidance processes with respect to individuals of 
subpopulation b holding opinion i. 

• u ab(t)[l “ A(* ; > 0 — Pb(h f)l delineates compromising processes during the 
remaining interactions with individuals of subpopulation b. 
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In the following computer simulations we choose for the readiness R a (i f \i] t) 
to exchange opinion i for opinion V due to a pair interaction the approach that 
was motivated in Section 7.3: 



U a (i',t)-U a (i,t ) 

*■<'1 i; "=W (814) 

with D a (i ' , i ; t) = 1. In addition, we assume the case of S = 3 different opinions 
i G {1, 2, 3}. The latter can be understood as a prototype of opinion formation 
situations because the opinions concerning a given affair can be classified ac- 
cording to the schemes ‘pro’, ‘contra’, ‘indifferent’, or ‘positive’, ‘negative’, 
‘neutral’. Furthermore, two subpopulations a G {1,2} are distinguished which 



a) either have the same preferences (U 1 ( 1,£) = (7 2 (l,f) = c, U 1 (2 1 t) = 
U\2,t) = 0, 17 1 (3,0 = Cf 2 (3,t) = -c) 

b) or different preferences (U 1 ( l,t) = U 2 {2,t) = c, U 1 (2 1 t) = U 2 (l,t) = 0, 
(7 1 (3,t) = C/ 2 (3,t) = — c). 



In the following figures the opinion with the largest preference (U a (i, t) = c) 
is always represented by a solid line ( — ), the opinion with medium preference 
(U a (i, t) = 0) by a dashed line ( — ), and the opinion with the lowest preference 
([/ a (i, t) = — c) by a dotted line (• • •). 



Persuasion Processes = 1, ^(0 = 0 = ^(0) c = 



If both subpopulations prefer the same opinion, then, as expected, only the 
opinion with the largest preference (i = 1) will survive (cf. Fig. 8.4a). However, 
if the subpopulations prefer different opinions (one of them i = 1 , the other 
one i = 2), a certain proportion of individuals will become persuaded of that 
opinion which is preferred by the other subpopulation (cf. Fig. 8.4b). 



Avoidance Processes (^(0 — 1) u lb(t) = 0 = c ~ 1) : 



If both subpopulations prefer the same opinion, avoidance processes in favour 
of less preferred opinions will frequently occur (cf. Fig. 8.5a). However, the 
preferred opinion will gain a significantly higher proportion within a subpo- 
pulation if the other subpopulation favours a different opinion (cf. Fig. 8.5b). 
Avoidance processes are only rarely necessary, then. 
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t 

Figure 8.4a: Persuasion processes for equal preferences: The preferred opinion 
( — ) wins out. 




t 

Figure 8.4b: Persuasion processes for different preferences: Some individuals 
become persuaded of the opinion which is preferred by the other subpopulation 
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t 

Figure 8.5a: Avoidance processes for equal preferences have the consequence 
that the favoured opinion is held only by a relatively small proportion of indi- 
viduals. 




t 



Figure 8.5b: In the case of different preferences avoidance processes occur rarely 
and their effect is relatively weak. 
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Compromising and Persuasion Processes M*) = i = KM, 
Kb( t ) = °’ c = 0.5): 

The preferred opinion will, as expected, completely win out if both subpo- 
pulations agree upon it (cf. Fig. 8.6a). If, however, the subpopulations prefer 
different opinions, the compromising processes will become relevant: Even an 
opinion which is not preferred by any of the subpopulations (i = 3) will be 
chosen by a certain proportion of individuals (•••), then (cf. Fig. 8.6b). 



8.3.2 Oscillations 



Opinion changes due to persuasion processes can also occur in an oscillatory 
way. In order to prove this let us, first, consider the case in which the sub- 
populations do not influence each other (z/J 6 (<) := v aa^ab)- The corresponding 
Boltzmann equations are obviously 



(8.15a) 



with the antisymmetrical matrix 

(8.15b) 

Equations (8.15) have at least the S stationary solutions P l a {i) with 



/e (8.16) 



We will now carry out a linear stability analysis, i.e. a first order Taylor 
expansion of equations (8.15) around each stationary solution Ph(i): The terms 
of quadratic and higher orders in 

6Pi(i,t):=P a (i,t)-Pi(i) (8.17a) 

can be neglected in comparison with the linear terms if the deviations SP^(i } t) 
from the stationary solution P l a (i) are sufficiently small which implies 

«Pj(i,0«l. (8.17b) 




E a = (£‘ ) := - # a 0'|i)) = 



j t P a (i,t) = ^ aa P a (i,t)J^Et j P a (j,t) 
j 



This procedure leads to the equations 




j -EfiSPfat) if*#/ 
\ ^E^6PiU,t) if i = l 



(8.18) 



where 6P l a (i,t ) > 0 for i # / and 6P l a (l,t) = — ^ 8P l a (i,t) < 0. 

‘‘(*0 
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t 



Figure 8.6a: Compromising and persuasion processes for equal preferences: The 
readiness for compromises has only a small relevance (cf. Fig. 8.4a). 




t 



Figure 8.6b: Compromising processes and persuasion processes for different 
preferences: A certain proportion of individuals decides in favour of a compro- 
mise (• • •). 
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Consequently, the stationary solution P l a (i ) is stable on the condition 

Vi(# 0- 



E?i > 0 



i.e. 



R a (l\i) > R a (i\l) 



(8.19) 



Otherwise it is unstable. Due to Eft = —Ef t maximally one of the solutions 
Ptti) can be stable. If such a stable solution exists, only one opinion can survive 
because of P l a (i) = <5,/. 

If none of the solutions P l a (i) is stable (which, by the way, cannot happen for 
the approach (8.14)), a further stationary solution P°(i) might exist which is 
defined by 






Vi 



(8.20) 



and (8.11). With (8.17), a linear stability analysis leads to 



-6P°(i, t) = v l aa p° a (i) £ E% 6P2U, t) = E M ij sp 2U> 0 • ( 8 - 21 ) 



3 3 

: l iP a 



Since the matrix E_ a is antisymmetrical, E ,a as well as the matrix 

M a = (m$) := (i/J a P a °(t)E§) (8.22) 



possess only imaginary eigenvalues (cf. Ref. [182]). As a consequence, 
6P°(i, t) and P a (i,t) show an oscillatory behaviour. Owing to the invariant 
of motion 



P a (i, t) Pa ^ = const. 



(8.23) 



(cf. Ref. [182] and Sect. 10.3.4) and the normalization condition P a (i, t) = 1, 
the trajectory (P a (l, £), . . . , P a (5, £)) tr moves on a (5— 2)- dimensional hyper- 
surface. 



In order to illustrate this, let us consider the particular case with 5 = 3 opinions 
and 



(n a m) 



0 o 1 

1 o o 
o 1 o 



i.e. 




0 -1 1 

1 0 -1 

-1 1 0 



(8.24) 



Then the stationary solutions P l a (i) (/ € {1, . . . , 5}) are unstable. P®(i) = 1/5 
is a further stationary solution with purely imaginary eigenvalues 




(8.25) 
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Equation (8.15) has the special form 

(8.26a) 

with i = i mod S (cf. p. 296) where 

(8.26b) 

can be interpreted as a time- dependent growth rate. 

• The proportion P a (i,t) of opinion i grows as long as the proportion 
P a (i — 1,0 of opinion ( i — 1) is larger than the proportion P a (i + 1 ,£) 
of opinion (i + 1), i.e. A a (i,0 > 0. 

• The increase of the proportion P a (i,t) of opinion i induces an increase 
of the proportion P a (i + 1,0 of opinion (i + 1) by A a (z + i,o > 0 and a 
decrease of the proportion P a (i— 1, 0 of opinion (i— 1) by A a (i— 1, t) < 0. 

• As soon as the proportion P a (i + 1,0 of opinion ( i + 1) exceeds the 
proportion P a (i — 1,0 °f opinion (i — 1) a decrease of the proportion 
P a (*,0 °f opinion i begins due to A a (i, 0 < 0. 

• These phases return again and again because of the cyclical order of 
the S opinions (because of i = i mod S) (cf. Fig. 8.7). 

The above situation can occur if the readiness R a (i — l|i) to exchange the 
present opinion i for the previously held opinion ( i — 1) is smaller than the 
readiness R a (i + l|i) to exchange the present for a new opinion (i + 1). After 
a number of opinion changes one possibly ends up in the original opinion and 
the cycle starts from the very beginning: So to say, one goes round in circles. 
Such cases are known from fashion or economy. 

Due to (8.23) and (8.11), for S = 3 the trajectory (P a (l+), . . . , P a (S , t )) moves 
on a one-dimensional ‘hypersurface’ (in a cyclical way). For its shape, from 
(8.11) and (8.23), we obtain the relations 

ft ( 2,0 = 

and 

ft (3, 0 = 1 - ft (1, 0 - ft (2, t) (8.27b) 






with 



C a P a (l, *o)ft(2, *o)ft(3, to ) . 



(8.27c) 
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t 

Figure 8.7: Oscillations for persuasion processes between 3 different opinions. 
The different opinions win over the majority of the individuals in an alternating 
but determined order. 



The corresponding cycles are depicted in Fig. 8.8 for various initial conditions. 

An examination of equation (8.26) shows that for S > 3 opinions the temporal 
evolution becomes more and more complex with increasing number S of diffe- 
rent opinions (cf. Figs. 8.9 to 8.11). In particular, under certain circumstances, 
the order in which the different opinions gain the majority of individuals varies. 
Over short periods of time the course of Pi(i,J) appears not necessarily peri- 
odic any more (cf. Fig. 8.9). However, a consideration of the phase portrait 
indicates that over long periods the temporal course is still periodic (or at least 
quasi-periodic) since the trajectory closes itself after a certain period of time 
(at least almost) (cf. Figs. 8.10 and 8.11). 



8.3.3 Influence of the Interaction Frequencies 

In the following we will examine the influence of the interaction frequen- 
cies fab(t) of kind k (cf. (7.57)) and of the related interaction rates 
u ab(t) :=z ^(O/a&W respectively. The interaction frequency /**,(£) of an indivi- 
dual of subpopulation a with individuals of subpopulation b certainly depends 
on the liking l a b(t) which individuals of subpopulation a have for individuals 
of subpopulation 6. 
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P t (1.t) 

Figure 8.8: Phase portraits of oscillations between 3 different opinions for var- 
ious initial conditions. 





Figure 8.9: Oscillations for persuation processes between 5 opinions: Over a 
short time interval the course of Pi(i, t) is not periodic. The different opinions 
win over the majority of individuals in a varying order. 
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Figure 8.10: Phase portrait of oscillations for persuasion processes between 5 
opinions: The trajectory moves on a torus. Over a long time the trajectory 
closes (at least almost) so that the long-term behaviour of is periodic 

(or at least quasi-periodic). 



0. 40 



0.30 



£> o.ao 
oT 

o. 10 



o.oo * » ■ - ■ - l 

0.00 0,10 0.20 0.30 0.10 

Figure 8.11: Phase portrait of oscillations for persuasion processes between 6 
opinions: The long-term behaviour is periodic since the trajectory forms a close 




curve. 
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For example, with 0 < l a b(t) < 1 and the approach 

(8.28a) 

(8.28b) 
(8.28c) 

we could model the fact that the frequency of imitative processes and compro- 
mising processes increases with the liking whereas the frequency of avoidance 
processes lessens when the liking grows. Here (t) is a measure for the over- 
all frequency of imitative processes within subpopulation a, fa(t) denotes 
the overall frequency of avoidance processes, and (t) that of compromising 
processes. 

Figures 8.12 to 8.14 illustrate the time- dependent solution of equations (8.13) 
for two subpopulations (a = 1: — , a — 2: — ), 

/0 0 l\ 

(R a (i\j;t)) := 1 0 0 , (8.29) 

V 7 \0 1 0 / 

and various interaction rates v\ h {i) = 0 = 

In the case 

«d(0=("i»(0 ):=(j ^5) (8-30) 

we find uncoupled oscillations in both subpopulations with different frequencies 
(cf. Fig. 8.12). 

If we have 

^(0 = (^)):=(J Q 1 ,) , (8-31) 

the behaviour of subpopulation 1 ( — ) will depend on the behaviour of sub- 
population 2 ( — ): The oscillation frequency of subpopulation 1 adapts itself 
to the one of subpopulation 2 (cf. Fig. 8.13). This phenomenon is called fre- 
quency locking. Here, the individuals of subpopulation 2 can, in a certain 
way, be understood as trendsetters or opinion leaders. 

Finally, for 

(832) 

both subpopulations influence each other mutually: The oscillations damp each 
other (cf. Fig. 8.14). 
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Figure 8.12a: Uncoupled oscillations of different frequencies for persuasion pro- 
cesses between 3 opinions in two subpopulations. 
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Figure 8.12b: Phase portrait of uncoupled oscillations: After some time the tra- 
jectory runs parallel to its previous segment which indicates a quasi- periodic 
temporal evolution. 
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t 



Figure 8.13a: One-sidedly coupled oscillations for persuasion processes between 
3 opinions in two subpopulations are connected with an adaption of the fre- 
quency (frequency locking). 




P t 0.t) 



Figure 8.13b: Phase portrait of one-sidedly coupled oscillations displaying a 
limit cycle as a consequence of frequency locking. 
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t 

Figure 8.14a: Mutually coupled osciallations for persuasion processes between 
3 opinions in two subpopulations result in a damping of the oscillations. 







Figure 8.14b: Phase portrait of mutually coupled oscillations showing a spiral 
motion towards a fixed point. 
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8.3,4 Period Doubling Scenarios and Chaos 



According to naive everyday experience opinion formation processes can also go 
off in a chaotic way (if one thinks, for example, of turbulent arguments regar- 
ding controversial affairs). In this section it will turn out that the Boltzmann 
equations (8.10) for opinion formation can, indeed, display a chaotic temporal 
evolution. Although the concrete interaction processes and parameter values 
that are assumed in the following are not empirically founded, for illustrative 
reasons we will choose an interpretation in terms of opinion formation proces- 
ses. A number of other Boltzmann equations showing chaos can be found in 
References [258,1,87]. 



Let us consider the case of one single subpopulation (A = 1) and direct pair 
interactions between 5 = 9 opinions given by the following interaction schemes 
(cf. (7.46)): 



1.9 4,9 

4.9 S- 1,9 

2.9 1,9 

1.6 4,6 

3.3 1,1 

1.9 £- 3,9 

1.3 £- 2,3 

6.6 8,6 

7.7 6,7 

8.8 £- 7,8 

4,5 4,4 

4.4 S- 5,4 



r 


(8.33a) 


}( 2 ) 


(8.33b) 


}( 3 ) 

> 


(8.33c) 


►( 4 ) 

J 


(8.33d) 


>(5) 

> 


(8.33e) 


1 ( 6 ) 


(8.33f) 



Here the rates ki represent the interaction rates i^i', j'|i, j) of the interactions 



* 



h3 



(8.34) 



A social interpretation of the above interactions can be given on the basis of 
classification scheme (7.46): 

The interactions summarized by (1) obviously describe a compromising beha- 
viour where opinion 9 catalyses a change between opinions 1 and 4 without 
itself being subject to a change. Opinion 9 also catalyses a transition from 
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opinion 1 to opinion 2 which is delineated by interaction (2). Besides, since 
opinion 9 itself is not subject to any opinion changes, it symbolizes a mature 
and mediating opinion. Opinion 6 has also a mediating effect, namely on the 
transition from opinion 4 to opinion 1 which is given by interactions (3). 

Interactions (4) reflect an interaction chain where in the first interaction 
process a mutual avoidance behaviour in favour of opinion 3 occurs if opinion 
1 meets opinion 1. Opinion 3, however, is replaced by opinion 1 as soon as it is 
confronted with the mediating opinion 9. During the time until this happens, 
however, opinion 3 catalyses transitions from opinion 2 to opinion 1. Opinion 2 
as well as opinion 1 are characterized by a particular readiness for compromises. 

By (5) a cyclic interaction chain of persuasion processes is described where 
opinion 6 mediates the change of opinion 8 to opinion 6, opinion 7 the change 
of opinion 6 to opinion 7, and opinion 8 the change of opinion 7 to opinion 8 
(cf. Sect. 8.3.2). 

Finally, by (6) an avoidance behaviour is delineated where opinion 4 is exchan- 
ged for opinion 5 if confronted with opinion 4. With a certain rate, however, 
also the reverse process occurs where opinion 5 is persuaded of opinion 4. Thus 
opinion 4 is characterized by some degree of undecidedness. 

The temporal evolution of the proportions P(i, f) of the 9 different opinions is 
reflected by the following Boltzmann equations: 



^P(M) = k 1 P(4,t)P(9,t)-k' 1 P(i,t)P(9,t)-k 2 P(i,t)P(9,t) 
+ Jfc 3 P(4, t)P( 6, t ) - 2* 4 [P(1, <)] 2 + * 4 P(3, t)P( 9, t) 





+ 


k"P(2,t)P(Z,t) , 


(8.35a) 


7, p < 2 '‘> 


= 


k 2 P(l, t)P(9, t) - *"P(2, <)P(3, t) , 


(8.35b) 


^(3,0 




2fc 4 [P(l, <)] 2 — k' 4 P(3,t)P(9, t) , 


(8.35c) 


S p < 4 '‘> 


= 


— fc 1 P(4, <)P(9, t) + *1P(1, f)P(9, t) - 


k 3 P(4,t)P(6,t) 




- 


k 6 [P(i,t)] 2 + k' 6 P(4,t)P(5,t), 


(8.35d) 


^<5,0 


= 


k 6 [P(4,t)) 2 -k’ 6 P(4,t)P(h,t), 


(8.35e) 


S P(6 ' ( > 


= 


k 5 P(6,t)P(S,t) -k' 5 P(6, t)P(7, t), 


(8.35f) 


S P(7 ' ( > 


= 


k' 5 P(Q,t)P(7,t)-k^P(7,t)P(S,t), 


(8.35g) 




= 


k^P(7,t)P(8,t)~ k 5 P(6,t)P(8,t) , 


(8.35h) 
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(8.35i) 



We immediately find P(9,t) = F(9, 0). By means of the transformation 

a if i e {1,2, 4, 6, 7, 8} , 

P(i, t ) =: j/, (t)P(9, 0) x < /? if i = 3 , 

7 if i = 5 



with 



and 





(8.38a) 

(8.38b) 

(8.38c) 



the Boltzmann equations (8.35) go over into the following scaled equations: 



^yi(r) = ^y4(T)-(5 + l)j/i(r) + /cy 4 (r)j/ 6 (T) 



- f{[yiW] 2 -y3(T)} + y2(T)y 3 (r), 


(8.39a) 


^ 2 / 2 (t) = Byi(r) - y 2 {r)y z {T) , 


(8.39b) 


^!fe(r) = e{bi(T)] 2 -y 3 (r)}, 


(8.39c) 


~~j^y*( r ) - -^y4(T) + j/i(r)-Kj/ 4 (r)y 6 (T) 




- Dy 4 (r) [j/ 4 (r) - j/ 5 (r)] , 


(8.39d) 


= -Dy4(T)[y 4 (r)-y 5 (r)] , 


(8.39e) 


^ye(T) = y 6 (r) [Cj/ 8 (t) - C'y 7 (r)] , 


(8.39f) 


^yr(r) = y 7 (r) [c'j/ 6 (t) - C"ya( r )] , 


(8.39g) 


^ys(-r) = t/s(r) [c'VM - Cy 8 (t)| . 


(8.39h) 
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Here we have 



A 


:= h/k [ , 


B := k 2 /k[, 




C 


:= ak$/k [ , 


C' := ak' 5 /k[, 


C" := atj'/ti 


D 


:= ake/k[ , 


E := k 6 /k' 6 , 


k := aks/k'i 


e 


:= 2ak^/k\ , 


<F 

II 





As a particular example let us consider the case 



A := 0, B := 1.2, 

C := 0.46, C' := 0.46, C" := 0.46, 

D := 100, E := 10000, k = variable, 

e := 0.01, e' := 0.0001. 



(8.40) 



(8.41) 



For 


:= 1 and 


a := 1 


(8.42) 


this implies 


ki 


= o, 


(8.43a) 


k 5 , £5, fcg 


~ 1 , 


(8.43b) 




~ 0 . 01 , 


(8.43c) 




~ 100 , 


(8.43d) 


*4 


~ 10000 . 


(8.43e) 


That is, the interactions 


3,3 A 


U, 


(8.44a) 


4,4 A 


5,4 


(8.44b) 


are slow, the interactions 


1,9 A 


3,9, 


(8.45a) 


4,5 A 


4,4 


(8.45b) 


are fast, and the interaction 


1,3 A 


2,3 


(8.46) 


is rapid compared to the remaining interactions. Therefore, 


the interaction 


chain (4) can be formally contracted to 


1,1,1 «— 


2 , 1,1 


(8.47) 
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which is equivalent to an interaction between three individuals. With 

! h(t) ~ 1, (8.48a) 

p ~ 0.0001, (8.48b) 

7 - 10000 (8.48c) 

this implies that proportion P(3,t) is very small and proportion P(5,t) very 

huge compared to the remaining proportions P(i>t) ( i & {3,5}). As a conse- 
quence, opinion 3 represents only a temporary stage which is left immedia- 
tely. Opinion 5 represents the opinion of the large majority. 

The computer simulations of equations (8.39) with parameters (8.41) and initial 
conditions 



Vi (0) := 0.6, 


!/2(0) := 1.8, 


Vs(0) := 0.36, 


y 4 ( 0 ) := 1 , 


r-H 

II 

0 s 

s 


MO) ■■= 112, 


2 / 7 ( 0 ) := 1 + 0.12 sin (27r/3) , 


y 8 (0) := 1 + 0.12 sin (4ir/3) 



show several period doubling scenarios to chaos along with a variation 
of the control parameter k € [0,0.55]. In Figure 8.17 the temporal evolution 
of yi(r) = yi(«,r) is depicted for various values of the control parameter 
k. In contrast, Figures 8.18 to 8.23 show, apart from the power spectra 
f(cj) = /(/c,u>) of the functions yi(r) = yi(/c,r), the yi-t /2 projections of the 
trajectories (the phase portraits) after they converged to their attractors. (For 
a detailed discussion of the interpretation of these figures cf. Sect. 5.5.2). 

Chaotic behaviour is characterized by the irregularity of the phase por- 
traits and power spectra (cf. e.g. Fig. 8.21). If there are distinct maxima in 
chaotic power spectra this indicates intermittency, i.e. random chaotic bursts 
in an otherwise nearly periodic temporal evolution (cf. Fig. 8.24). 

Figures 8.18 to 8.21 on the one hand and Figures 8.22, 8.23 on the other 
hand indicate the existence of several period doubling scenarios. This becomes 
particularly clear in Figure 8.15 which depicts the largest Liapunov exponents 
A,(/c). The critical points Kj where period doubling occurs are characterized 
by the fact that the largest Liapunov exponent grows up to zero for decreasing 
values of k > Kj and falls down again for k < Kj. Chaotic behaviour is observed 
if the largest Liapunov exponent is greater than zero. 

The above mentioned period doubling scenarios can be related to a simpler sy- 
stem due to the circumstance that the parameters (8.41) occuring in the scaled 
equations (8.39) possess completely different magnitudes. As a consequence, we 
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have 



{[!/i( r )] 2 - 2/3 (t)} = 7^2/3(t) « 0, 


(8.50a) 


1 / 4 ( 1 -) [jfe(r) - j/ 4 (r)] « i-^ 2 / 4 (r) « 0 , 


(8.50b) 


^ 2 / 5 ( 1 -) = £ 2 / 4 ( 1 -) [ 2 / 4 ( 1 -) - 2 / 5 ( 1 -)] « 0 , 


(8.50c) 


2 / 3 ( 1 -) « [j/i(r)] 2 , 


(8.51a) 


2/4(t) « 2 / 5 ( 1 -) , 


(8.51b) 


2 / 5 ( 1 -) « 2 / 5 ( 0 ) . 


(8.51c) 




K 

Figure 8.15: The largest Liapunov exponents for the considered Boltz- 
mann equations: Critical points where period doubling occurs via a pitch- 
fork bifurcation are characterized by the fact that the largest Liapunov 
exponent just reaches zero. If it exceeds zero, the temporal behaviour of the 
system is chaotic. 
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Summarizing this, 2 / 3 (r) can be adiabatically eliminated. In contrast, 2 / 4 (r) 
and 2 / 5 (r) are quasi- constant. Thus the equations for 2/1 (r) and 2 / 2 (t) simplify 
to 



^yi(r) = A — (B + l)yi(r) + [j/i(t)] 2 j/ 2 (^) + Kye(r) , (8.52a) 

^ 3 / 2 ( 1 -) = 5j/i(r) - [j/i(t)] 2 j/ 2 (-t) (8.52b) 

where ye(r) displays non-linear oscillations (cf. Sect. 8.3.2). The relation of 
these equations with those of the sinusoidally forced Brusselator (cf. Refs. 
[112,113,258]) is obvious. 

Figure 8.16 shows the largest Liapunov exponents for the reduced system 
of equations (8.52), (8.39f-h). It reveals a strong agreement with Figure 8.15 
though chaotic systems usually react in a very sensitive way on small variations. 
Consequently, there are also areas where considerable deviations are found, e.g. 
for 0.325 < k < 0.35. 




K 



Figure 8.16: For the reduced system (8.52), (8.39f-h) of the scaled Boltzmann 
equations the largest Liapunov exponents show, in dependence of k, a similar 
course as those of the exact equation system (8.39) depicted in Figure 8.15. 
The separation of time scales, i.e. the adiabatic elimination and the 
approximation of quasi- Constance for some variables seems, therefore, to be 
justified for the most part — even in the chaotic regime. 
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(M)U 

Figure 8.17: Temporal evolution of the scaled coordinate yi(r) = t/i(/c, r) after 
the transient time for various values of the control parameter k (/c = 0.55: 
Period 1; k = 0.495: Period 2; k = 0.4: Period 4; k = 0.389: Period 8). The 
period reflects after how many oscillations the temporal behaviour recurs. 




188 



8. Opinion Formation Models 




§° 



Figure 8.18: Phase portraits (below) and corresponding power spectra f(u) 
(above) for various values of the control parameter /c (/c = 0.55: Period 1; 
k = 0.495: Period 2). In the phase portrait a period doubling manifests itself 
by an additional loop (or indentation), in the power spectrum by additional 
maxima in the middle of those maxima which were found before the period 
doubling. 





U.00 




Figure 8.19: Phase portraits (below) and related power spectra f(u>) (above) 
for various values of the control parameter k (k = 0.4: Period 4; k = 0.389: 
Period 8). 






0,3876 




Figure 8.20: Phase portraits (below) and corresponding power spectra /( u) 
(above) for various values of the control parameter k (k = 0.3875: Period 16; 
k = 0.385: Chaos (intermittency of period 4)). 









Figure 8.21: Phase portraits (below) and related power spectra /( lj ) (above) 
for various values of the control parameter k (k = 0.38: Chaos (intermittency of 
period 2); /c = 0.365: Chaos (intermittency of period 1)). The phase portraits 
have a fractal dimension (cf. Sect. 5.5) and are called strange attractors. 






0.3600 I L k - 0.3600 
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{x)H 






Figure 8.22: Phase portraits (below) and corresponding power spectra /( u>) 
(above) for various values of the control parameter k (k = 0.36: Period 3; 
k = 0.342: Period 6). 






ootro - * II 




Figure 8.23: Phase portraits (below) and related power spectra /(a;) (above) 
for various values of the control parameter /c (/c = 0.34: Period 12; k = 0.32: 
Chaos). 
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Figure 8.24: Chaotic temporal evolution of the scaled co-ordinate yi(r) after 
the transient time for various values of the control parameter k (k = 0.385: 
Intermittency of period 4; /c = 0.38: Intermittency of period 2; k = 0.365: 
Intermittency of period 1; k = 0.32: Chaos). Intermittency is characterized by 
nearly periodic phases (so-called intermissions) that are randomly interrupted 
by irregular bursts. 
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8.4 Generalizations 



The approach of the previous sections can be generalized in various aspects: 



• The results in Sections 8.2 and 8.3 obviously hold only for local or spatially 
homogeneous cases of opinion formation. If, apart from opinion changes, 
the effects of location changes are to be considered, we must represent by 
the state x (or y) not only the opinion i (or j) but also the location r 
(or s) of an individual: 

*:=(*»> V •=(],»)• (8-53) 

• The restriction to opinion changes can be dropped because the previous 
discussion can be directly extended to more general behavioural changes. 

• If we want to model pair interactions and indirect interactions together , 
we must use the configurational master equation with the transition rates 
that were introduced in Section 3.5. The resulting approximate mean va- 
lue equations (3.16) are BoLTZMANN-like equations of the form (7.54) 
in which, however, the transition rates 

w a {x'\x\t) = w a {x , \x\ (n)\t) (8.54a) 

t o a b(x',y'\x,y;t) = w ab (x',y'\x,y;{n);t) (8.54b) 

depend on the mean value (n). Here transition rates (8.54b) describe pair 
interactions that implicitly depend on indirect interactions. Examples for 
the combination of spontaneous transitions and indirect pair interac- 
tions will be discussed in connection with the game dynamical equations 
(cf. Sect. 10.4). 

• Up to now, due to the Markov assumption (1.17) and (1.91) respec- 
tively, we neglected memory effects. If necessary, however, these can be 
taken into account by the generalized BoLTZMANN-like equations 
(cf. Ref. [47]) which are derivable from the generalized master equation 
(1.6) in a similar way as in Chap. 2 (or Chap. 3). They read 



dt 



P a {x,t) 



I dt 1 ^2 \wS_ t ,{x\x' \t')P a (x' ,?) 

to x> 

-w?_ { ,(x'\x-,t')p a (x,t')} 



(8.55a) 
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with the effective transition rates 



(8.55b) 



The memory effects go back partly to short-term memory and partly 
to long-term memory. For the effects of short-term memory the Mar- 
kov assumption will normally be justified. The long-term memory shows 
up in the increase of knowledge and experience as well as in the 
behavioural type but it has, in most cases, probably only a subordi- 
nate influence on the dynamics of the medium-term social processes we 
are mainly interested in. 



w^_ t ,(x'\x\t') .- w\ ‘'(x'l *;*') 



+ EEE"**' {x',y'\x,y, t')P b (y,t'). 
b y y 1 



8.5 Spatial Spreading of Opinions 

This section is dedicated to approaches for the description of spatial aspects 
of opinion formation which, up to now, were put aside. There are several pro- 
cesses which can lead to a spatial dissemination of opinions — on the one hand 
diffusion processes and migration processes, on the other hand telecom- 
munication processes. Of course, both kinds of processes might also occur 
simultaneously. 

8.5.1 Opinion Spreading by Diffusion 

In the case of opinion spreading by diffusion we will suppose that opinions 
disseminate spatially by random location changes of the individuals. With 



(8.56a) 



and 



«>«i(*', s'|i, r,j, s ; *) := w r ai (i', j'\i, j;t)6(r' - r)6(s' - s)S(s - r ) 

(8.56b) 

we assume that location changes and opinion changes occur independently of 
each other. Moreover, the location changes happen spontaneously whereas the 
opinion changes stem from pair interactions at location r = s. The approach 



w a ,i(r'\r;t) = v r a>i {t)p a , ,(r'|r;<) 



w a {i' ,r'\i,T\t) := w ai i(r'\r;t)6u> 



(8.57a) 
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with 

(8.57b) 

guarantees that location changes arise only between neighbouring locations with 
a probability that decreases with growing distance ||r / — r|| from the destination 
r'. Equation (7.54) for the spreading of opinions then reads 




b s' j j' 



sw-;*) ■ EEEE i; t)Pa{i', r, t)Pb(j,r\ t ) 



- w r ab (i',j'\i,j] t)P a (i, r; t)P b (j, r; f)] 

+ J d 2 r' [ w aii (r\r';t)P a (i,r';t ) - u> a ,;(r'|r;t)P a (i,r;t)] • 



(8.58) 



However, if ,(t) is independent of r , one often prefers the representation 







-Kbii'J'lh 



*, i'\i\ j]t)Pa(i',r ; t)P b (j , r; t) 
,j;t)Pa(i,r-,t)P b (j,r;tj\ 



+ D ai (t)A r P a (i,r-,t) 



(8.59a) 

in which A r denotes the Laplace operator. Equation (8.59a) results from 
equation (8.58) by carrying out a second order Taylor approximation (cf. 
Chap. 4). Especially, there is 



D a ,i{t) = J d 2 r' {r' -rfw a<i {r'\T\t) 



— J dip J dAr 1 (Ar 1 ) 

o o 

= 6 l / a ,«(<)( fi a ,«') 2 



A3_iV(0_ -Ar'/fi a ,, 

’ 2*(Ra,i) 2 



with Ar' := ||r' - r||. Equation (8.59) is known from chemistry as the 
reaction- diffusion equation and can produce spatial waves or spirals (cf. 
Ref. [153]). Puu has suggested spatial oscillations also in connection with simi- 
lar models of opinion spreading [227]. However, the dissemination of opinions 
by diffusion applies only to a few situations, e.g. to crowded halls. 

Another situation for which a slow diffusion by migration of individuals is 
assumed was examined by Schweitzer et al. [257]. They subdivided the space 
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of the system into discrete boxes and chose for the migration rates the 
approach 

(8.60) 

According to this, the migration rate w a) i(r f \r;t) is proportional to the mobi- 
lity i/ a ,t(0 which depends on the opinion i E {0, ±1, ±2, . . .} of the individuals. 
For c a > 0 the migration effect is repulsive and the individuals distribute all 
over the system. Whereas individuals with a neutral position ( i = 0) tend to an 
equipartition due to w at o(r'\r;t) = v a , o(0> individuals with an extreme po- 
sition (with large |i|) are inclined towards concentrating in small spatial areas, 
i.e. towards forming an ‘autonomous scene’. For large |i|, the settledness 
l/ u; a > i(r , |r; t) is large only if the concentration nf r /N a of individuals of the 
same opinion is high. 




8.5.2 Opinion Spreading by Telecommunication 

In cases 



• where individuals, on the one hand, change their place but, on the other 
hand, always return to the same place r (their flat) or 

• where individuals stay at a fixed place r but have an opinion exchange 
by telephone (or correspondence ) with individuals at other places 



we must apply another model. Since in such cases the place r can be treated 
as a quasi-constant, characteristic quantity of an individual, it formally plays 
a similar role as the behavioural type a. This implies 

(8.61a) 



(8.61b) 

For the distance dependence of the interactions we could, for example, as- 
sume 

(8.61c) 

because exchanges of views will become less frequent with growing distance 
|| s — r||. If we define 





Pa(i, r; t) =: P a (r)P?(i, t) 



(8.62a) 
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N r 

W := Tf 



(8.62b) 



Ewo = ! 



(8.62c) 



and equation (7.54) for the spatial spreading of opinions takes on the form 



P r a (M) = 



+ EEEEE 

s b i' j j 1 

lb i; (f, W(a)P!(j, <)1 • 



(8.63) 

Since N* denotes the number of individuals of subpopulation a who are at 
place r we obtain 



E N «=Na and E P »( r ) = 1 ' 



(8.64) 



Consequently, P a ( r) hats the meaning of the population distribution of sub- 
population a over the places r. P*(i,t) is the proportion of individuals of 
subpopulation a at place r with opinion i and thus it describes the proba- 
bility with which an accidentally selected individual of subpopulation a at 
location r will hold opinion i. 

The simplest case of the spatial spreading of opinions is the dissemination of 
news or rumours. Then we have only to distinguish the ignorance ( i = 1) and 
the knowledge (i = 2) of the concerned news (i.e. i € {1,2}). The spreading of 
news bases on pair interactions of the kind 

2,2 4 — 2,1, (8.65a) 

2,2 4 — 1,2, (8.65b) 

i.e. on ‘persuasion processes’. Processes of forgetting the news are modelled 

by spontaneous transitions: 

1 4 — 2 . (8.66) 

Other processes will be neglected. If we, in addition, restrict ourselves to one 
single subpopulation (^4 = 1), the equation (8.63) for the spreading of news 
takes on the simple form 



-P r (2, t) = £ w rs { 2, 2|1, 2; t)P(s) 1 - P r ( 2, t)\ P s ( 2, t) - w r ( 1|2; t)P r ( 2, t) 



(8.67) 
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where we took into account P r (l,£) = 1 — P r (2,£) and dropped the subs- 
cript/superscript a. In the limit R—+ 0 (cf. (8.61c)), (8.67) has the form of the 
logistic equation (cf. (7.65)) and describes a local spreading of news. 

For the population distribution P(s) the homogeneity assumption P(s) = 
const, is normally not justified. Due to the population concentration in towns 
P(s) varies considerably. An idealized scheme of the spatial population dis- 
tribution is given by Christaller’s system of central places which is 
illustrated in Figure 8.25. Already equation (8.67) is no longer solvable analy- 
tically and numerically only with the effort of much computer time. Due to the 
spatial self- similarity of the system of central places, however, the methods 
from renormalization theory possibly allow the determination of approxi- 
mate solutions for the spatial spreading of opinions (cf. Refs. [117,136, 143]). 



Figure 8.25: Schematic 
representation of the pop- 
ulation distribution ac- 
cording to Christal- 
ler’s system of cent- 
ral places (cf. Ref. [39]): 
By means of circles of 
varying size towns with 
different numbers of in- 
habitants are represen- 
ted. The population dis- 
tribution has approxim- 
ately a self-similar hexa- 
gonal structure which, 
however, in reality is dis- 
torted by topography. 




o 

• * 

— i — -Q i 

/ *r \ • 

a ^ «D 

\ • • / • 

• \ 1 / • 

# T 

• • • 

o 4) o 

• • 

o 




Finally, let us stress that equation (8.63) can, due to 



P t (s)w^(i',j%j;t) = N r a w?,(i',j%j;t) 



( 8 . 68 ) 



a := (a, r), b :=(b,s), 



(8.69) 



with the abbreviations 
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also be written in the familiar form 
d 



— P a {i,t) = ^2 [w a (i\i';t)Pa(i',t) - tt> a (»'|*; O^MM)] 

V 

6 i' 3 3' 



- , j ; t)P a {i, t)p b (j , *)] 


(8.70) 


= i). Here there are 


£ : = £• 


(8.71a) 


b b,S 


P a (i,t) := P?{i,t), 


(8.71b) 


w a (i'\i-,t) := w°{i'\i\t) , 


(8.71c) 




(8.71d) 



In contrast to the right-hand side, the subscripts on the left-hand side of (8.71) 
have the meaning that was defined in (8.69). The possibility of representation 
(8.70) bases on the circumstance that, in this section, location r is treated as 
a characteristic, quasi-constant quantity of an individual, i.e. in the same way 
as the behavioural type a. 




9 Social Fields and Social 
Forces 



9.1 Introduction 



From physics we know that nothing changes if there is no force at work. Con- 
versely, the effect of a force is characterized by a change of the state of a system. 
Similar relations hold in connection with human behaviour. An individual will 
not change his/her behaviour if he/she is satisfied with his/her situation, i.e. if 
there is no reason for a change like a (physical or psychological) need, social 
pressure, etc. Therefore, we will try to describe the reasons for behavioural 
changes by social forces. 

A related approach was already suggested by Lewin [169] who developed the 
‘social field theory’ according to which the behaviour of an individual is 
determinded by a ‘ social field’. This reflects the influences of the physical 
and social environment, e.g. of public opinion, social norms, and trends. 
Although Lewin formalized his ideas, a fully mathematical formulation has 
long been missing. Such a formulation will be presented in this chapter in terms 
of differential equations. Since the behaviour of an individual is not completely 
determined by the environment (cf. Sects. 6.2.3 and 6.2.4) these equations 
must be stochastic in order to take into account the individual behavioural 
variations. 

The behaviour of individuals in a social field shows an analogy to the behaviour 
of electrons in an electric field (e.g. of a nucleus). In particular, individuals as 
well as electrons, on the one hand, influence the effective (social or electric) 
field which, on the other hand, affects their behaviour (cf. Sect. 4.6). However, 
when developing a model for the influence of a social field on behaviour, we must 
not be tempted by this analogy to apply the Schrodinger equation [250] 
which is not suitable for describing social processes. Instead, we have to start 
from the BoLTZMANN-like equations which were obtained for behavioural 
changes in Chapter 7. A reformulation of these equations by means of a second 
order Taylor approximation leads to special diffusion equations. Because of 
their relation to the Boltzmann equation and the Fokker-Planck equa- 
tion they are called ‘Boltzmann-Fokker-Planck equations’ (cf. Sect. 
4.6). These can be transformed into mathematically equivalent Langevin 
equations (cf. Chap. 5). 

According to the Boltzmann-Fokker-Planck equations and the Langevin 
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equations the systematic and the most probable behavioural change respectively 
are given by vectorial quantities K a (x y t) which can be interpreted as social 
forces. They describe concrete motivations to act which are evoked in an 
individual as a reaction to his/her environment and are the drive behind be- 
havioural changes. Individual behavioural variations due to the ‘freedom of 
decision-making 9 are taken into account in the equations by diffusion co- 
efficients. 

In simple cases social forces can be expressed by potentials V a (x,t) which 
possess the above described characteristics of social fields. A social field can be 
imagined as a range of mountains in behavioural space the gradient (steepness) 
of which is a measure for the social force which acts at a certain location of 
behavioural space, i.e. on a certain behaviour. 

It turns out that social forces K a {x ) t) are noticeable only for those individuals 
whose behaviours x deviate from minima (‘valleys’) of the social field (from 
the ‘norm’). However, a behaviour which in subpopulation a is exposed to a 
social force can in another subpopulation b correspond to the norm. 

The social force model has the advantage that it provides a very intuitive 
description of social processes. It is suitable for the delineation of opinion for- 
mation, purchase pattern [261], group dynamics, or pedestrian motion [124, 
123, 134]. However, due to the Taylor approximation, it is only applicable if 

1. the behavioural changes can be represented in more or less continuous 
behavioural spaces (which may differ from one subpopulation to another), 

2. the behavioural changes mainly take place in small steps. 

In cases where these conditions are not fulfilled we must use the Boltzmann- 
like equations instead. 



9.2 Derivation 



In the following we will assume to be confronted with continuous or at least 
quasi- continuous behavioural spaces y a • The m dimensions of these spaces 
correspond to various characteristic aspects of the considered behaviours. The 
BoLTZMANN-like equations (7.43) then have the form 



d_ 

dt 



P a (x, t) = j (TV [w a (x\x'; t)P a (x', t ) - w a (x' \x; t)P a (x, *)] 

= JcTy w a [y\x-y,t]P a (x-y,t)-w a [y\x;t]P a (x,t)j (9.1a) 
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with the effective transition rates 



w a [x f — x\x] t ] 



w a (x'\x\t) 

w a (x'\x;t) + ^ JdT'y’ JtT n ywab(x' i y'\x,y;t)P i (y,t). 

b 

(9.1b) 



From Chapter 4.6 we know that equations (9.1) can, by means of a second or- 
der Taylor approximation, be transformed into special diffusion equations, 
namely the Boltzmann-Fokker-Planck equations 




with the effective drift coefficients 



K ai (x,t) := j <ryyiW a [y\x\t\ = JcTx' Ax' i w a (x'\x\t) 



and the effective diffusion coefficients 



(9.2a) 



(9.2b) 



Qaij{x,t) := Jd m yy i y j w a [y\x-,t] = j cTx' Ax' i Ax' j w a (x'\x;t) 



(9.2c) 



(Ax(- := x(- — x,-). Whereas the drift coefficients K a i(x,t) reflect the systematic 
motion (‘drift’) of distribution P a (x,t), the diffusion coefficients Q a ij(n,t) 
describe the broadening of distribution P a (x,t) by fluctuations which result 
from individual variations of the behavioural changes (cf. Sect. 4.5.1). 

In the following we will apply the special BoLTZMANN-like equations with the 
effective transition rates 



w a (x / \x] t) := w a {x'\x\ t) + v a (x\ x\ t)R a (x'\x; t) 



(*' ± x ) , (9.3a) 



V a (x',x\t) := 

b 

+ (&( 0 "&( 0 )ft (*,0 + &( 0 ] 



(9.3b) 



which were developed in Chapter 7 for the description of behavioural changes. 
Then the effective drift and diffusion coefficients split up into contributions by 
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spontaneous transitions (k = 0), imitative processes ( k = 1), avoidance 
processes (k = 2), and compromising processes ( k = 3): 



3 



K ai (x,t):=£K£ ( (x,t), 

k= 0 



(9.4a) 



3 



Qaij( x >t) Qaij( x )t) * 

k=0 



Here we introduced the abbreviations 



(9.4b) 



KiM 



K 3 ai (xJ) 



Qaij( X )t) : 

Qaij( x ’t) : 

Qaij( X ’t) : 
Qaij ( x >0 : 



= j d m x' Ax'iWaix'lxit), 


(9.5a) 


- j dT'x' Ax' i R a (x'\x\t)P b (x\t), 


(9.5b) 


- ld rn x'Ax' i R! l {x l \x\t)P b {x,t), 

b J 


(9.5c) 


- j dT'x' Ax'iR a {x'\x\t) 




8 

1 

£ 

1 

X 


(9.5d) 


J cF'x' Ax( Axyiy 0 (x , |x; t) , 


(9.6a) 


^2^ah(t)fl b (t) Jd m x' Ax'Ax'i? a (x , |x; t)P b (x',t) , 


(9.6b) 


Yl l ' ab ^^'>( t ) J d^x' Ax' i Ax' j R a (x , \x-,t)P b (x,t) , 


(9.6c) 


^2vab(t)f2 b (t) J d m x' Ax' i Ax' j R a (x'\x;t) 




X (l - P b (x', t) - P b (x, «)) . 


(9.6d) 



In particular, K^x, t) and Qaij( x *t) delineate behavioural changes which are 
induced by the physical environment. 



The Boltzmann-Fokker-Planck equations (9.1) can, with the method 
introduced in Chapter 5, be transformed into mathematical equivalent 
Langevin equations. They read 



dx Q f j 

dt 



F ai{Xont) "h ^ ^ Ggij{Xg , 
j-l 



(9.7a) 




9.3. The Social Force Model 



207 



with 




(9.7b) 



and 

(9.7c) 



Here x a (t) denotes the time- dependent behaviour of an individual a of subpo- 
pulation a. The vector 



m 

Qaij {& ooO = * ^ ^ Gqik{x a , i)G a jk{&ot) t) 

k = l 



Ca t(*ajO y ^ Gaij (Xg, t)£ a j(t) (9*8) 

i = l 



describes the contribution to the change of behaviour x a (t ) which is caused by 
individual variations (fluctuations) £<*(£) of the behaviour. 



If the diffusion coefficients Q a ij(&a,t) and thus the coefficients G a ij(x a ,t) 
are small quantities (cf. (9.7c)) which is often the case, we have F a i(x a ,t) « 
Kaifacnt) (cf. (9.7b)). Then equation (9.7a) can be written in the form 



^ « K a (x a ,t) 



+ individual fluctuations 



(9.9) 



9.3 The Social Force Model 



For the following reasons it appears to be justified to denote the vectorial 
quantity 

K a (x, t) := ( 'K a i(x , <)-•••. K ai (x, t),..., K am (x , t)') (9.10) 

as the social force which acts on individuals of subpopulation a: 

• The various contributions jfiT*(x, t) to K a (x,t) add like vectors as it must 
be demanded from forces. Especially, oppositely oriented contributions 
with the same absolute value cancel out each other. By the terms 

K 2 a (x,t), K 3 a (x,t ) (9.11) 

which depend on the behavioural distributions Pb(y,t) in other subpopu- 
lations 6, the contributions due to pair interactions among individuals 
are taken into account. 
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• K a (x a ,t) can be regarded as the drive of the temporal change dx a /dt 
of the behaviour a? a (£) of an individual a belonging to subpopulation a. 
Namely, the vectorial quantity K a (x a ,t) describes, according to (9.9) 
and (5.8), the most probable behavioural change and, according to 
(9.2) and Section 4.5.1, the systematic influence on the direction and 
magnitude of the behavioural change of individual a. 

• If K a (x,t) vanishes this corresponds, due to 

VxVa(x,t) = K a (x,t) = 0, (9.12) 

to an extremum of the social field V a (x, t). Then, according to (9.9), the 
behavioural distribution P a (x, t) will be only subject to changes which are 
caused by fluctuations and described by diffusion coefficients Q a ij(x,t). 

• Nevertheless, social forces are certainly no physical forces obeying the 
NEWTONian laws of mechanics [198, 101]. For example, in physics we 
have the law ‘actio = reaciio’ according to which a body that exerts 
a force on another body feels a counterforce of the same strength but 
opposite direction. In the social sciences, however, this is generally not va- 
lid. Moreover, the effect of a social force does not base on an energy and 
momentum exchange via so-called virtual particles but on an infor- 
mation exchange which evokes a behavioural change only indirectly via 
complicated psychological and mental processes (cf. Fig. 9.1). Indi- 
viduals are active systems that produce the state changes (behavioural 
changes) themselves. 

Therefore, social force K a (x,t) has rather the meaning of a concrete 
motivation to carry out a certain behavioural change. It is a reaction 
to the information which an individual obtains by the perception of 
his/her physical and social environment. This reaction is the result of a 
decision process during which from various available behavioural alter- 
natives that one is selected which comes next to the personal goals and 
interests, i.e. which brings about the maximal utility (cf. Sect. 6.2.4). 

• According to Chapter 7, the spontaneous transition rates for social pro- 
cesses often have the form 



U a (x' ,t)-U a (x,t) 
e 

D a (x , ,x\t) 

Let us examine the case where there is no difference between the utilities 
U a (x,t ) of different behaviours x which could evoke a spontaneous beha- 
vioural change ( U a (x ) = t/ a ). Furthermore, let the effort S a (x' , x; t) of a 
behavioural change depend only on its magnitude \\x* — x\\: 



9.13 



w a (x \x;t) := i/ a (t) 



D a (x',x;t) = D a (||x' - *||; t) . 



(9.14) 
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Figure 9.1: A complete model of behavioural changes must contain three levels 
of description — the perceived stimuli, their influence on the psychological 
motivation to act, and the subsequent physical realization. 

According to the concept of the social force model, a sensory stimulus leads 
to a behavioural reaction (response) that depends on the personal goals and 
is chosen from a set of behavioural alternatives with the objective of utility 
maximization. 
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As expected, in this case the contribution to the social force 

j K a (x,t) which is responsible for spontaneous behavioural changes vanis- 
hes. Systematic behavioural changes only occur due to pair interactions 
among individuals, then. By means of an appropriate choice of the state 
space 7 a (with the method of multidimensional scaling [161,301]) re- 
lation (9.14) can always be achieved (at least approximately). 



In simple cases it is possible to express a social force K a (x,t) by a social field 
Vaix^t). If the social force obeys the integrability conditions 





dK ai (x,t) \_dK a j(x,t) . 

dxj dxi % ^ 


a time-dependent potential 




X 

V a (x,t) :=-j dx'-K a (x',t ) 



(9.15) 



(9.16a) 



exists so that the social force K a (x , t) is given by its gradient V*: 



K a (x,t) = -V rV a (x,t) 



i.e. 



K ai (x,t)=:- — V a (x,t) 



(9.16b) 



Clearly spoken, due to (9.16b), we can imagine the social field V a (x,t) as a 
range of mountains in behavioural space, the steepness of which is a measure 
for the social force which acts at a certain location x of the behavioural space, 
i.e. on a certain behaviour x (cf. Fig. 9.2). This range of mountains varies 
dynamically and describes the effects of the physical and social environment 
including public opinion, trends, and norms on the behaviour. 



If the diffusion coefficients are constant under variation of x (Q a ij(x) = Qaij), 
according to (4.29), (4.31) the formal stationary solution of (9.2) is given 
by 



P<t( x ) ~ Pa( x o)e 



4>a{x) 



(9.17) 



with 



(9.18) 



The stationary solution is then determined by the stationary social field V a (x). 
However, since the diffusion coefficients QJx) = ^Qaij(x)^ themselves nor- 
mally vary with x , the relation for the stationary solution is in general con- 
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V a (x,t) 




Figure 9.2: A social field 
V a (x,t) is comparable to 
a range of mountains. 
Here a rather smooth ver- 
sion with one hill and 
two valleys is depicted 
for a one- dimensional be- 
havioural space. The gra- 
dient (slope) y7 x V a (x,t) 
at a certain point x = x 
of the behavioural space 
is a measure for the di- 
rection and magnitude of 
the social force K a (x,t) 
that acts on the corre- 
sponding behaviour. 



siderably more complicated and can possibly not even be described by means 
of potential representation (9.17) (cf. Sect. 4.4.1). Relation (9.18), already, is 
an implicit equation since the stationary drift coefficients K a %{x ) and dif- 
fusion coefficients Q a ij{&) depend on the stationary probability distributions 
P®(y) according to relations (9.1b), (9.2b), and (9.2c). Thus for (9.18) there 
only exists a recursive solution (cf. Sect. 4.6.1). 

In summary, we can state the following about the developed model: 

• Behavioural changes are guided by a social force K a (x, t) which describes 
the most probable individual behavioural change or the systematic influ- 
ence on the direction and magnitude of a behavioural change. In simple 
cases the social force can be expressed by a social field. 

• The individual variations of behaviour are reflected by diffusion coeffi- 
cients Qaij (*> f )• 

• Due to the dependence of the social force on the interactions between 
individuals the social field is not simply given externally. The form of the 
social field is rather affected by the behaviour of individuals. 

• The influence of the event history and of memory effects can be ta- 
ken into account by the generalized Boltzmann-Fokker-Planck 
equations which are derivable from the generalized master equa- 
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tion (1.6) in total analogy to Chapter 4. These differ from the ordinary 
Boltzmann-Fokker-Planck equations only by an additional integra- 
tion over past times V < t and read 




(9.19a) 



with the effective drift coefficients 

:=/<* 

the effective diffusion coefficients 

<?£'(*,<')•= /<*”VA 

and the effective transition rates 

<_<-(*' l*;0 := *;*') 

b 

(9.19d) 



!”V AarJu;®_ t /(* / |«;e / ) 



(9.19b) 



L«J-Aa:jti;®_ t /(a; , |aj;^ / ) 



(9.19c) 



9.3.1 Comparison with Lewin’s ‘Social Field Theory’ 

The above developed model can be regarded as a mathematical formulation of 
Le win’s concept of a ‘social field theory 5 (which certainly is no field theory 
in the sense of physics). 

From social psychology it is known that the behaviour of an individual is given 
by the totality of environmental influences (including social influences) and 
his/her personality. Inspired by the theory of electromagnetic fields, social 
field theory assumes that the environmental influences can be considered as a 
dynamic force field which should be mathematically representable. 

If an individual is in equilibrium with the dynamic force field, a temporal 
change of the field will induce a psychological tension. This finally causes a 
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behavioural change which compensates for the psychological tension and thus 
restores the equilibrium. 

In the following we will show up the connection of the just sketched social field 
theory with the social force model (cf. Tab. II): 



• According to the discussion of Section 9.3, the social field V a (x } t) corre- 
sponds to the dynamic force field which mathematically describes the 
environmental influences as perceived by individuals of subpopulation a. 

• If the behaviour x a of an individual a does not agree with a minimum 
x a of the social field Va(a; a ,<), according to (9.9), the social force 

K a (Xa,t) = -V Xa V a (x a ,t) (9.20) 

will be induced. This is given by the gradient of the social field V a (x a ,t) 
and causes a behavioural change according to 

^*K a (x a ,t) ( 9 . 21 ) 

which pulls the behaviour in direction of the steepest descent of V a (x a ,t). 
Thus the social force (the motivation) K a (x<x> t) plays the role of Lewin’s 
‘psychological tension 9 . 

• The behavioural change dx a /dt drives the behaviour towards a minimum 
x a of the social field V a (x a J). When this minimum is reached (x a = x a ) 
we have 

V Xa V a (x ai t) = 0 (9.22) 

and, therefore, K a (x a} t) = 0. As a consequence, the psychological ten- 
sion K a (x a ,t) disappears, i.e. it was compensated by the behavioural 

change. Then, according to (9.9), only accidental behavioural changes 
take place. The minima x a of the social field V a (x a ,t) thus correspond 
to the behaviours which are in equilibrium with the dynamic force field. 

• The social fields V a (x ay t) of different subpopulations a normally possess 
different minima i a . Consequently, individuals of different behavioural 
types a normally feel different motivations Ka(x a ,t) to act which causes 
different behavioural changes. Therefore, by subpopulations a different 
types of personalities are distinguished. 
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Social Field Theory 


Social Force Model 


Dynamic Force Field 


Social Field 


Psychological Tension 


Social Force 
(Motivation to Act) 


Personality 


Type of Behaviour 
(Subpopulation) 


Equilibrium with the 
Dynamic Force Field 


Minimum of the 
Social Field 



Table II: Terminological interrelations between Lewin’s social field theory and 
the social force model. 

In the special case where an individual prefers a behaviour x a we expect be- 
havioural changes in accordance with the formula 

(9.23) 

at 

This corresponds to the social field 

V a (x a ,t) « y (*„ - x a ) 2 . (9.24) 

Examples for this case are discussed in Reference [124]. 



9.4 Computer Simulations 



The social force model is able to describe a broad spectrum of social phenomena. 
It is, for example, suitable to reproduce pedestrian motion [123, 124, 134], to 
model group dynamics (cf. pp. 13, 16), or to delineate opinion formation. 

In the following we will illustrate a few examples for the application of the 
social force model by computer simulations. For this purpose, let us consider 
the case of two subpopulations ( A = 2) and assume a situation where the 
interesting aspects of behaviour can be characterized by a certain position x 
on a one-dimensional continuous scale (m = 1, x = x). A concrete example 
for this are opinion formation processes. Then, by the two subpopulations, 
we could distinguish individuals of a conservative and a progressive attitude 
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respectively. Position x would represent the grade of agreement or refusal with 
regard to a certain political affair (e.g. SDI, nuclear power stations, or the Gulf 
War). In contrast to Chapter 8 where we distinguished S different opinions i, 
this time we consider a one- dimensional opinion continuum. The various 
positions of this continuum are represented by ‘x’ in order to avoid confusions 
with discrete opinions i. 

For one-dimensional behavioural spaces y a the integrability conditions (9.15) 
are automatically fulfilled so that the social fields 

X 

V a (x, t) ~- J dx' K a {x', t) - c a (t) (9.25) 

X 0 



always exist. The parameters c a (t) can be chosen arbitrarily. We will select the 
value 



c a (t) := min 

X 




(9.26) 



which sets the absolute minimum of V a {x,t ) to zero. 



In the one-dimensional case, according to (4.70), the formal stationary solu- 
tion of the Boltzmann- Fokker- Planck equation (9.2) (which is approached 
in the long-time limit t — ► oo) reads 



P° a (x) = P°(* o) 



Qa(x 0 ) 
Qa(x) 



exp 



2 j dx' 

. Xq 



Kg(x') 

Qa(x') 



(9.27) 



Because of the dependence of K a (x) and Q a (x) on Pj°(y) equations (9.27) are 
only implicit equations. However, from (9.27) we can draw the following 
conclusions: 






If the diffusion coefficients are constant (Q a (%) = Qa )> (9.27) simplifies 
to 



P° a (x) = P° a (x o)exp 



' 2 / , . \ 

~PT~ ( » / a(^) "b 1 

Vo ' ' 



(9.28) 



so that the stationary solution P® (x) is given by the social field V a {x). In 
particular, the behavioural distribution P<?( x ) has its maxima at positions 
x a where the social field V a (x ) has its minima. The diffusion constants Q a 
regulate the broadness of the behavioural distribution P® (x). For Q a — 
0 there were no individual behavioural differences and the distribution 
P°(x) had a sharp maximum at the position of the lowest minimum of 
Va(x). 
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• If the diffusion coefficients Q a (z) vary with position x , the behavioural 
distribution P° (x) will be also influenced by the concrete form of Q a (x). 
According to (9.27), we expect large probabilities P® (ar) at those positi- 
ons x where the diffusion coefficients Q a {x) are small. This is illustrated 
by Figures 9.4 and 9.7 in which the form of the probability distribution 
P\{x)> in contrast to Figure 9.3, cannot be understood merely by the 
social field V\(x). 

• Since the stationary solution P ® (#) depends on V a (x) as well as on Q a (x), 
different combinations of V a (x ) and Q a (x) can lead to the same probabi- 
lity distribution P® (x) (cf. Fig. 9.6 in the long-time limit). 



In the following the simulations will be restricted to the case of imitative and 
avoidance processes. This has the consequence that the form of the social field 
T4(x,f) varies with time t only due to the temporal change of the probability 
distributions P&(t/, £), i.e. due to behavioural changes of the individuals (cf. 
(9.5), (9.16a)). For the positions x we will assume a quasi- continuum x = 
ij 20 G [1/20, 1] with the discretization i G {1, 2, . . .,20}. Furthermore, let the 
transition rates again be of the form (9.3) with 



(9.29a) 



I. := | (9 2%) 



individuals of subpopulation a prefer position x a . 1 /L a represents a measure 
for the resolution of subpopulation a with respect to its preferred position x a . 
The larger L a the larger is the indifference of subpopulation a regarding the 
position held. Finally, let us assume the relation 



R a {x l \x\t) := 



D a (x',x,t) 



(cf. (7.62a)). By means of the choice 

U a (x,t) :=-| ( £ X f£ ) a -C«W 



with 



Vab(t) -|*'-*|/r 

D a (x',x,t) ~ e 



with 




(9.29c) 



for the meeting rates i/ a b(t) and the distance functions D a (x* ,x;t). Then posi- 
tion changes occur the less frequently the larger they are, i.e. the larger \x* — x\ 
is. R can be interpreted as a measure for the ‘range’ of the interaction. 



• If the range R is very small, the diffusion coefficients Q a (a:,f) can be 
neglected. Consequently, the fluctuations play an unimportant role and 
the probability distribution P a (x,f) is mainly given by the social field 
V a {x,t) (cf. Fig. 9.3). 
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• For larger but still small values of R there occur only small position 
changes to neighbouring positions with a rate worth mentioning. In or- 
der to understand the temporal evolution of the probability distribution 
P a (x, t) we must take into account the diffusion coefficients Q a (x,t). then 
(cf. Figs. 9.4 and 9.7). 

• If R exceeds a certain value the second order Taylor approximation be- 
comes useless. In order to be able to comprehend the temporal evolution 
of P a (x,t), apart from K a (x,t) and Q a (£,f), also higher jump mo- 
ments are required (cf. Sect. 4.2). This, however, presupposes a better 
approximation than that of the Boltzmann-Fokker-Planck equati- 
ons. 



In order to avoid mistakes due to the Taylor approximation from the very 
beginning and to be able to use a simple numerical solution method, the calcula- 
tion of the distributions P a (x,£), social fields F a (x,t), and diffusion coefficients 
Q a (x,f) was carried out directly via the BOLTZMANN equations (9.1) with the 
effective transition rates (9.3), (9.29). 

Figures 9.3 to 9.8 depict the results of computer simulations for various chosen 
parameters /* 6 (J), and Here the represented functions are scaled in such 
a way that their maxima reach half of the box height. Again for the interaction 
frequencies f* b (t) of kind k we use, like in Section 8.3.3, the relations 

/i»(0 = /i(0M0 (9.30a) 

and 

/«»(<) = /2(*)(l- MO) (9.30b) 

where l a t(t) denotes the liking of subpopulation a for subpopulation b (0 < 

MO < !)• 



9.4.1 Imitative Processes 

In the following let us consider only imitative processes (/„(*) = 1, /|(t) = 0). 
Let the preferred positions x a be at 

6 , 15 ni\ 

*1 = 20 *'“* := 20 ' < 9 ' 31 > 

On assumption 

(UW)s(/i(<)):=(j [) (9.32) 

individuals of subpopulation a = 1 like those of subpopulation a = 2 but not 
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vice versa. This implies that subpopulation 2 influences subpopulation 1 in a 
one-sided way. 

As expected, for both behavioural distributions P a (£, t) a maximum occurs at 
the preferred position x a . Additionally, a second maximum of Pi(x, t) develops 
around the preferred position X 2 of subpopulation 2 due to the imitative beha- 
viour of subpopulation 1. This second maximum will be small if the indifference 
L\ of subpopulation 1 with respect to changes of the position x is small (cf. 
Fig. 9.3). 

However, for a large indifference L\ of subpopulation 1 with regard to the po- 
sition held even the majority of individuals imitates the behaviour of subpo- 
pulation 2 (cf. Fig. 9.4). One could say that the individuals of subpopulation 2 
act as trendsetters. This phenomenon is typical for the spreading of a new 

fashion. 

Now let us consider the case 

(MO) S (/ a y<)) := ( } }) (9.33) 

where the subpopulations influence each other mutually with the same inten- 
sity. If the indifference L a with respect to changes of the position x is small in 
both subpopulations a, both probability distributions P a (x , t ) form two maxima. 
Whereas the higher maximum is located at the position x a which is preferred 
by subpopulation & itself, the second maximum is around the preferred position 
of the other subpopulation. It is the higher the larger the indifference L a is (cf. 
Fig. 9.5). 

However, a phase transition occurs if L a exceeds a certain critical value at 
least in one subpopulation. Then only one single maximum forms so that we 
observe an entirely different situation. In order to explain this, let us consider 
Figure 9.6. Since the indifference of subpopulation 2 is large, here, its indivi- 
duals move towards the position of subpopulation 1 due to imitative processes. 
Nevertheless, also the majority position within subpopulation 1 shifts some- 
what but, due to the smaller indifference L\, only a little. After some time, the 
subpopulations meet each other at one common position. Thus we could say 
that both subpopulations have agreed on a compromise. This compromise 
(the maximum of the distributions P a (x,J)) is closer to the preferred position 
x\ of subpopulation 1 which shows the smaller indifference L\. 

The remarkable thing about this compromise is that the behavioural distribu- 
tions Pa^x^t) agree after some time though the social fields V a (x,t) and the 
diffusion coefficients Q a (%it) of both subpopulations a differ considerably. The 
reason for this is that the preferred positions x a of both subpopulations a are 
still different. 
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Figure 9.3: Opinion distributions P 0 (x,f), social fields F a (x,t), and diffu- 
sion coefficients Q a (x y t) for one-sided imitative processes of subpopulation 
a = 1: In the course of time the probability distribution Pi(x,£) develops a 
second maximum around the position x 2 which is preferred by subpopulation 
2. That is, some individuals of subpopulation 1 become persuaded by the pre- 
ferred position #2 of subpopulation 2. The maxima of the opinion distributions 
P a (x,t) are given by the minima of the corresponding social fields V a {x y t). 
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Figure 9.4: Opinion distributions P a (x,t), social fields V a (x,t ), and diffusion 
coefficients Q a (x,t) for one-sided imitative processes of subpopulation a = 1: 
Since here the indifference L\ of subpopulation 1 with respect to the position 
held is large, most of its individuals take over the preferred position x^ of 
subpopulation 2 (of the trendsetters). The opinion distributions P a (x,t) are 
not only determined by the corresponding social fields V a (x,t) but also by 
the diffusion coefficient Q a (x,t) (by individual variations of the behavioural 
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Figure 9.5: Opinion distributions social fields V a (x i t) } and diffusion 

coefficients Q a (x> t) for mutual imitative processes of both subpopulations: For 
a small indifference L a of both subpopulations with respect to their preferred 
positions x ai a certain proportion of individuals becomes persuaded by the 
other subpopulation’s preferred position. This is the larger the larger the indif- 
ference L a of subpopulation a is. 






Figure 9.6: Opinion distributions P a (x,t), social fields V a {x,t), and diffusion 
coefficient Q a {x,t) for mutual imitative processes of both subpopulations: If 
the indifference L a of at least one subpopulation is large, the subpopulations 
will agree by means of persuasion processes on a common position (compro- 
mise) which is closer to the position x a that is preferred by the less indifferent 
subpopulation (here: a = 1). Although, after some time, the opinion distribu- 
tions P a (x,t) are almost identical the corresponding social fields V a (x,t) differ 
considerably due to the difference concerning the preferred positions x a . 
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9.4.2 Avoidance Processes 

In the following we will consider only avoidance processes (/*(f) = 0, 
/ a 2 (t) = 1). With 

and := 55 (9 34) 

let us assume that both subpopulations a prefer almost the same position x a . 
Figure 9.7 shows the case 

(MO) : = ( o i ) . i-e. = ( J J ) • (935) 

So individuals of different subpopulations do not like each other. This corre- 
sponds to a mutual influence of both subpopulations on each other. The results 
of the computer simulations show that 

• the individuals avoid the positions x which exist in the other subpopula- 
tion, 

• the subpopulation a = 1 with the smaller indifference L\ < Li holds 
positions which are close to its preferred position x\ and pushes away the 
other subpopulation from its preferred position £2- 

Although the initial distribution P a (x, 0) agrees in both subpopulations a, after 
some time there is almost no overlap of the behavioural distributions P\(x,t) 
and P2(x,t) any more. This is typical for social polarization phenomena. 
Familiar examples for this are the development of ghettos or the formation of 
hostile groups. 

Figure 9.8 assumes the case that individuals of subpopulation 2 like those of 
subpopulation 1 and thus do not react on behaviours within subpopulation 1 
with avoidance processes: 

(MO) == ( 1 J ) > i-e- (/«*( 0 ) = ( 0 l ) • ( 9 36 ) 

Consequently, the distribution P2(x,t) does not change with time whereas 
Pi(x,t) drifts further and further away from the preferred position x\ due 
to avoidance processes. Surprisingly enough, the polarization effect is much 
smaller than in Figure 9.7 according to the considerable overlap of the distri- 
butions P\(x, t) and P^x, t). The reason for this is that the gradient of P2(z> 0 
is much smaller than in Figure 9.7 and, in addition, it stays constant. As a 
consequence, the frequency with which an individual of subpopulation 1 meets 
disliked individuals of subpopulation 2 can hardly be decreased by a small 
change of the position x. From this we could draw the conclusion that polari- 
zation effects (which often lead to an escalation) can be reduced if individuals 
do not return dislike with dislike. 
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Figure 9.7: Opinion distributions P a (x , *), social fields V a (x , t ), and diffusion co- 
efficients Qa^x^) for mutual avoidance processes of both subpopulations: The 
individuals avoid the positions x which are held within the respective other 
subpopulation. As a consequence, after some time the opinion distributions 
Pi(x,t) and P 2 (x,t) do not overlap any more so that we can observe a polar- 
ization. In addition, the subpopulation a = 2 with the larger indifference L 2 
is pushed away from its preferred position X 2 . 
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Figure 9.8: Opinion distributions P a (x,t) for one-sided avoidance processes of 
subpopulation 1: Whereas subpopulation 2 keeps its behavioural distribution, 
the individuals of subpopulation 1 avoid the positions which are held within 
subpopulation 2. The polarization effect is much smaller than for reciprocal 
avoidance processes in the case of mutual dislike (cf. Fig. 9.7). 



10 Evolutionary Game Theory 



10.1 Introduction 



For a long time game theoretical approaches have been used for a quantita- 
tive description of the competition or cooperation of interacting individuals. 
These go back to VON Neumann and Morgenstern [196] and meanwhile they 
are also empirically validated [46,233,230]. In game theory two aspects stand 
in the centre of interest — the individual attempts to maximize the perso- 
nal success and the dynamics resulting from this in connection with the 
interactions between the individuals. 

It turned out that, especially for the understanding of the evolution of coope- 
ration among ‘egoists’, a repeated interaction and therefore the time dimension 
is of great importance (‘shadow of the future 5 ) [13]. Therefore, a multiple 
repetition of single game dynamical moves has been suggested (‘iterated ga- 
mes’). Meanwhile, however, the game dynamical equations find more and 
more resonance [137,139,256,271,303]. These are differential equations and 
thus continuous in time which will be more realistic at least if there are no 
separate, clearly defined moves at well-defined points in time. An interesting 
application of the game dynamical equations is the modelling of the spontane- 
ous self-organization of behavioural conventions in the case of equivalent 
competing behavioural strategies (cf. Sect. 10.4.1). 

The game dynamical equations originated from the theory of evolution [137, 
271,303] where their application turned out to be very successful. Therefore, 
in connection with the continuous formulation, we also speak of evolutionary 
game theory. However, in contrast to the theory of evolution where the equa- 
tions can be derived from the laws of genetics (cf. Ref. [138], pp. 43ff), for their 
application in the social and behavioural sciences up to now a ‘microscopic’ 
foundation is still missing. Such a foundation of the game dynamical equations 
on the basis of individual behavioural principles will be introduced in Section 
10.2.4. It connects the game dynamical equations with the BoLTZMANN-like 
equations for behavioural changes which were developed in Chapter 7. In order 
to establish this connection we have to interpret P a (i, t) as the probability with 
which the (behavioural) strategy i is pursued within subpopulation a. Then 
Pab(i f ij'\h j]t) describes the probability of two individuals to change their stra- 
tegies due to pair interactions. Therefore, the remaining problem is to specify 
the corresponding transition rates •= Vab(t)Pab(i ' in 

dependence on the expected successes of the strategies. 
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10.2 Derivation of the Game Dynamical Equa- 
tions 



10.2.1 Payoff Matrix and Expected Success 

Let E a i(i,j;t) be the success of strategy i for an individual of subpopulation 
a in an interaction with an individual of subpopulation b pursuing strategy j. 
Furthermore, let 

( 10 . 1 ) 



denote the relative meeting frequency of an individual of subpopulation a 
with individuals of subpopulation b. Then the expected success of strategy 
i is obviously given by 




i? a (i,£) r<ib(t)Eab{h j \ 0 

b j 



(10.31a) 



The matrix 






(10.31b) 



is often called the payoff matrix. Normally, the meeting rate i/ a b(t) = v a b{t)Nb 
of an individual of subpopulation a with individuals of subpopulation b is pro- 
portional to the relative frequency N^/N of individuals of subpopulation b: 



Vab(t) = = Ht)N h . 



In this case we have 




e 



(10.3a) 

(10.3b) 



10.2.2 Customary Derivation 

For the temporal evolution of the proportion P a {i,t) of strategy i within sub- 
population a we expect a law of the form 



^ Pa(i , t) = A a (i, t)P a (i, t ) 



(10.4) 
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(cf. (8.26a) and Sect. 2.5.2) where the growth rate A a (z,£) depends on the 
expected success E a (i,t) of strategy i. Due to the normalization condition, 
A a (i,t) must be chosen in such a way that 

j t e p a (i, o = E j t p •(*'» *) = 0 ( 10 - 5 ) 

i i 

is fulfilled. We immediately obtain a law of the desired form if we assume that 
the growth rate is proportional to the difference between the expected success 
E a (i } t) of strategy i and the average expected success 

( 10 . 6 ) 



A .(*, t) := v a (t) [£,(*, t) - <£«)] • (10.7) 

Then the proportion P a (i, t) of the strategies i with an expected success jE7„(i, t) 
which exceeds the average ( E a ) will grow whereas the proportions of the remai- 
ning strategies will decrease. The resulting equations 



( 10 . 8 ) 



are known as the game dynamical equations. 

10.2.3 Fields of Application 

Game theory has many interesting applications in economics (e.g. in oligopoly 
theory [269,280]) as well as in the social and behavioural sciences (e.g. in 
connection with the ‘prisoner’s dilemma’) [12, 13, 139, 175, 196, 234, 252, 253, 
283]. However, the game dynamical equations also play an important role in 
biology (evolution theory) where they are often called replicator equations 
[139,255,262]. A generalization of (10.8), 

= E [^a(i\i'-,t)Pa{i',t)-w a (i'\i;t)P a {i,t)^ (10.9a) 

i 1 

+ v a (t)P a (i,t)[E a (i,t)-{E a )\ , 





(10.9b) 
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is known as the selection-mutation equation ( [139], pp. 249f, [72], pp. 230ff) 
or as the Eigen-Fisher equation [60,78]. (10.9b) can be understood as the 
effect of selection (if we interpret E a {i^t) as the fitness of strategy i). In 
contrast, (10.9a) describes the effect of spontaneous mutations. Whereas the 
relevance of equation (10.9) for the theory of evolution is obvious [61,72], 
in game theory the mutation term can be used for modelling trial and error 
behaviour by spontaneous strategy changes. 



10.2.4 Derivation from the BoiTZMANN-Like Equations 



For behavioural changes we have developed BoLTZMANN-like equations (cf. 
Chap. 7). Consequently, we expect a connection between these equations and 
the game dynamical equations. A comparison of equations (10.9) with the 
BoLTZMANN-like equations 

J t p a (i, t) = Y, [“'VI*'; 0 - “’VI*; o] (io.ioa) 

V 

with 

w a (i\i'\ t) := iu a (*|* 7 ; t) + v a (i , i 7 ; t)R a (i\i f ; t) (10.10b) 

and 

v a {i, i'; t) M0 [ (/a\(0~/?i (<)) p b{h <)+ (/a»(0“/?»(0) P V> 0+/f»(0] 

(10.10c) 

shows that the (generalized) game dynamical equations (10.9) can result from 
these only for 



"iiW = Vob{t)fl h {t) := v a {t)6 ab 



and 



f ab {t) = 0 = f ai (t) 



( 10 . 11 ) 



Thus for the game dynamical equations, apart from spontaneous transitions, 
only imitative processes are of importance. Avoidance processes or compro- 
mising processes have no influence. In order to specify the interaction between 
the individuals in an appropriate way we argue as follows: 



• When an individual meets another individual of whatever subpopulation, 
the expected success E a (i, t ) of the pursued strategy i is evaluated: An in- 
dividual of subpopulation a meets individuals of subpopulation b at a rate 
^a&(0- With probability Pb(j,t) the interaction partner from subpopula- 
tion b pursues strategy j. The success of the individual of subpopulation 
a in the corresponding interaction will be Consequently, the 

evaluation of the expected success 

Ea(i, t) = ^2 r «b{t)E ab (i,r, t)P b (j, t) 
b j 



(10.12a) 
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with 

r ah (t) = ” ab(t ] s (10.12b) 

c 

is actually possible. Of course, the expected success is normally not cal- 
culated by the individual but rather it is estimated from personal expe- 
rience [228,229,251]. 

• Furthermore, when an individual meets individuals of the same subpo- 
pulation (who are cooperative due to common interests), the expected 
successes E a (i,t) of different strategies i are compared with each other 
by exchange of experiences. With a certain probability pJjO’KjO the 
strategy j of the interaction partner is imitated. The readiness R a (j\i\ t) 
to give up strategy i in favour of strategy j will be the larger the larger 
the increase 

A iiEa-Eaij^-Eafrt) (10.13) 

of the expected success is. However, if an adoption of strategy j were 
connected with a decrease of the expected success (A jiE a < 0), no imi- 
tation would take place at all. In summary, a plausible approach for the 
readiness to change the strategy from i to j is 



R a (j\r, t) := max ( E a (j , t) - E a (i, t), o) 



(10.14) 



It describes an individual optimization or learning process. 

• When an individual meets individuals of other subpopulations (who are 
assumed to behave in a competitive way), normally no imitative proces- 
ses will take place: During these interactions the expected success Eb(j,t) 
of the competitor can at best be estimated by observation since he/she 
will not tell his/her experiences. Moreover, there might be 

E a {j y t)^E b {j,t) (10.15) 

due to different criteria of success. That is, the expected success E b (j, t) 
of strategy j for an individual of subpopulation b does not admit the con- 
clusion about the expected success E a (j, t) of the same strategy for an 
individual of subpopulation a. As a consequence, an imitation of the stra- 
tegy of an individual of subpopulation b by an individual of subpopulation 
a(/ 6) would be very risky since it would probably be connected with a 
decrease of the expected success. Hence the assumption 

flb{t) w 0 for b^a (10.16) 

will normally be justified. 
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Relation (10.16) also results in cases where the individuals of each subpo- 
pulation a cannot imitate the strategies of the respective other subpopu- 
lations b because these are not available for them (i.e. because of different 
strategy sets). 

Now, by inserting (10.11) and (10.14) into (10.10), we obtain the desired (ge- 
neralized) game dynamical equations 



(10.17) 



because of 

max (^E a (i,t)- E a (i' - max ^E a (i', t) -E a (i, 0, o) = E a {ht) — E a {i^t ) . 

(10.18) 

We can conclude that the game dynamical equations (10.9) result from the 
BoLTZMANN-like equations for a special case of imitative processes . In order 
to define the quantities v a {t) — P a a(0/ia(0 an d £u(M) in a unique way, we 
finally put 

which implies z/ a (£) — ^aa(0 (10.19) 

so that the rate v a (t) is determined by the meeting rate v a a{t) within subpo- 
pulation a. 

By the way, the approach 

R a (j\i\ t) := E a (j , t) with E a (j, t)>0 (10.20) 

would also lead to the game dynamical equations (10.9). However, in contrast 
to (10.14), it would assume a considerable readiness for strategy changes even 
if these were connected with a decrease of expected success. Thus (10.20) is not 
very plausible. 

10.3 Properties of Game Dynamical Equa- 
tions* 

In the following we will list some important properties of the game dynamical 
equations (cf. Ref. [139], pp.!24ff). 





*This section can be omitted during a first reading. 
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10.3.1 Non-Negativity and Normalization 

As it is known from Section 2.4.1, 

P a {i,t)> 0 and ^P a (i,i) = 1 V< > t 0 (10.21) 

i 

is guaranteed for all BoLTZMANN-like equations (and therefore also for the 
game dynamical equations) if at a certain time to we have P a (Mo) > 0 and 
P a {i ) ^o) = 1 for all strategies i. The non-negativity and the possibility of 
normalization are preconditions for an interpretation of the functions P a {ht) 
as probabilities or proportions of the strategies i. 



10.3.2 Formal Solution 



For the game dynamical equations (10.9) we can find the formal solution 

(10.22a) 

with the matrices 

(10.22b) 

(cf. p. 299) and 

(10.22c) 
(10.22d) 

This solution satisfies the initial and normalization conditions. Further- 
more, by deriving with respect to time and applying the quotient rule (cf. 
p. 306) we obtain 

7 , p -^ 

_ L i'j 

V i',j 
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_ J ^ i 

= E (*&(*) + LiAtj)P*W,t) - E (^i'W + ^■'(0)^«(i , .0^(i,0 

= v a (t)E a (i,t)P a {i,t) - P a (i,t)'^2v a (t)E a (j,t)P a (j,t) 

j 

+ E LUWi', t ) - P a (i, <) E 0 

»' i'J 

= ^Lf i ,(t)P a (i , ,t) + v a (t)P a (i,t)(E a (i,t)-(Ea)J . (10.23) 

*' 

With (1.100) this means that the formal solution (10.22) fulfills the game dy- 
namical equations (10.9). 

10.3.3 Increase of the Average Expected Success in Sym- 
metrical Games 

For the weighted mean 

(10.24) 

of the average expected successes (E a (i,t)) the fundamental theorem 

(10.25) 

can be proved given w a (V\i\t) = 0, the time-independence of the successes 
(E a b(j, *; t) = E ab (j , *)), the symmetry 

(10.26) 

and 

r ab (t) = ^ (10.27) 

(cf. (10.3b)). By interchanging the variables of two interacting individuals a 
and /? we find 






Eab(J’yj') — j) — z) — E ba {j,l} . 



(10.28) 
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Hence we obtain 

d(E) 

dt 



= E^r + 

a \ i i ) 

= E T E £«(*'. t) (Mi, 0 - (Ea)) 

a i 

+ E^E p -^oE^^^>‘WA(j,o(^(i.o - (^>)(io.29) 

a i b,j 

-^-E ai {i,j)P b (j,t)u a {t)P a (i,t)^E a {i,t) - (£■„)) 

a »* &J 

+ EEjAO'-Oj ^aO'.O .°(^a(M)(^(M)-<^)) (10.30) 

<m 6 ' j 

= E ab (i,j) 

= ^^»%t)^Pa(i,t)E a (i,t)(E a (i,t) - <£ 7 a >) 



= 2E7^W((( £ «)V(£.> J ) 

a 

= 2 £ ^ a (0 £ t) - (Ea)) 2 Pa(i, t) 



> 0. 



(10.31) 

(10.32) 



From (10.29) to (10.30) we interchanged a and b as well as i and j. In the case 
of one single subpopulation ( A = 1) relation (10.31) states that the increase 
of the average expected success is proportional to the variance of the expected 
successes . 



10.3.4 Invariant of Motion for Antisymmetrical Games 



In the following let us consider the case of one single subpopulation (^4 = 1) 
and a time-independent antisymmetrical success matrix 



(10.33) 



Then we find for the average expected success 



(E(i,t)) = -(E(i,t)) = 0 



(E(i,j))=-(E(J,i)) 



(10.34) 
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and (10.8) simplifies to 



|p(i, t) = u(t)E(i, t)P(i , t) = v(t)P(i, t) £ j)P{h t) • (10.35) 



If P° = (P°(ii), . . . , P°(is)) tr is a stationary point of (10.35) with 



5>(i,j)p o (j) = o 



Vi, 



the quantity 



C :=HP(j,t) p0 ^ 



is time-independent, i.e. an invariant of motion due to 

§ = 

= v(t)Cj2P°(i)E(i,j)P(j,t) 

= - V (t)c'EPv,t)'EEu,i)p , >{t) 

j * 

= 0 . 



(10.36) 



(10.37) 



(10.38) 



Because of the invariant of motion (10.37) and normalization condition (10.5), 
the trajectory P(t) = (P(ii, £), . . . , P(«s, t)) tr moves on a (5 — 2)-dimensional 
hypersurface. Concrete examples for this were discussed in Section 8.3.2. 



10.3.5 Interrelation with the Lotka-Volterra Equations 
The general Lotka-Volterra equations [172, 173,282] have the form 

(i €{!,...,<?}). (10.39) 




They are particularly often applied in population ecology for the descrip- 
tion of food chains (predator-prey systems) (cf. Refs. [91,109,139,219]). 
Ui(t) > 0 denotes the size of population i . The constant Aio describes the 
rate of change of population i in the absence of other populations whereas the 
constants Aij reflect the influences of populations j on the temporal change of 
population i (via predator-prey relations or symbioses). 
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For the Lotka-Volterra equations of 5 variables 2/i(r), . . . , y s ( r ) it can be 
shown that they are mathematically flow- equivalent to the game dynami- 
cal equations of ( S + 1) variables P(0, £), P(l, £),..., P(5> t) (cf. Ref. [139], 
pp. 134f). In order to prove this, we first transform the 5 equations (10.39) 
with 

A 0 j := 0 and y 0 ( r ) = j/o(^o) := 1 (10.40a) 

into the (5+1) equations 

= j/,(r) • (i G {0, 1,...,5}). (10.40b) 



These equations result from the game dynamical equations 

j t P(i, t) = vP{i, t) [E(i, t) - (£ 7 )] (i € {0, 1, . . ., S}) 
by means of invertible variable transformations: Let 



be the relative population sizes. Then, due to j/o( r ) = 1, we find 



(10.41) 



(10.42) 



P( 0,t) = -jd (10.43) 

&.(<) 



and thus 





»(*) _ „ , t s _ P (M) 
yo(t) p(o,t)- 


(10.44) 


The application of the quotient rule and of (10.41) leads, after a few conversions, 

l(mo) = mo' , '[ £(i ' ,) - £:(0 ’ ,, l' (10 ' 45) 


(10.45) implies with 


Aij := vA{i,j) 


(10.46a) 


and due to 


s 

E(i,t ) = Y^A(i,j)P(j,t), 


(10.46b) 



j = o 

5 



E(o,t) = 5>(o,j)P(j,t) = o 

j = o 



(10.46c) 
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the relation 




= w(0 • 



s 



j=0 



= Vi(t ) 



5 



Ajj yj(t) 

j=o 



p(o ,0 



(10.47) 



With the time scaling (velocity transformation) 

t 

r(t) := J dt' P(0,t') (10.48) 

to 



we finally obtain (10.40). 



10.3.6 Limit Cycles and Chaos 



The special game dynamical equation (10.8) with i >{t) = v and the payoff 
matrix 



A 



0 0 -k 1 \ 

1 0 0 -K 

-k 1 0 0 

0 -k 1 0 / 



(10.49) 



has the stationary point P° = (1/4, 1/4, 1/4, l/4) tr . A linear stability ana- 
lysis with 8P(i , t) := P(i, t) — P°(i) •< 1 results in 



~8P{i,t) = Y^M ij 8P{j,t) 
j 



where the JACOBian matrix 



M = 




f — 1 + K — 1 + K — 1 — K 

1 + K — 1 + K — 1 + K 
— 1 — K 1 + K — 1 + K 
{—1 + K —1 — K 1 + K 



1 + K ^ 

— 1 — K 

— 1 + K 

-1 + K / 



(10.50a) 



(10.50b) 



Ai - — (— 1 + k) , 


(10.51a) 


A 2 = ^(-1-k), 


(10.51b) 


A 3 = ^(« + i), 


(10.51c) 


II 

1 


(10.51d) 



possesses the eigenvalues 
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If k £ [—0.5, 0.5] is varied from negative to positive values, Aj and A 2 will 
always be negative whereas the complex conjugate pair A 3 , A 4 will cross the 
imaginary axis from the left half-plane to the right one. Hence the fixed point 
P° is stable only for k < 0. For k = 0 one finds a Hopf bifurcation, and for 
small values k > 0 one discovers a limit cycle (cf. Sect. 5.5 and Ref. [138], 
pp. 178f). 



Game dynamical equations (10.8) can even show a chaotic temporal evolution 
[7,279]. For example, this is the case for the payoff matrices 



and 



( 


0.0 


0.5 


- 0.1 


0.1 \ 




1.1 


0.0 


1 

o 


o 

o 




-0.5 


1.0 


o 

o 


o 

o 


\ 


1.7 


- 1.0 


1 

o 


0.0 ) 



A 



0.000 0.063 

0.537 0.000 

-0.535 0.380 

0.536 -0.032 



0.000 0.437 \ 

-0.036 -0.001 
0.000 0.655 

-0.004 0.000 / 



(cf. Ref. [248]). 



(10.52) 



(10.53) 



10.4 Stochastic Version of the Game Dynami- 
cal Equations 



Due to the fact that the game dynamical equations are, according to Section 
10.2.4, a special case of the BoLTZMANN-like equations we can, in connection 
with Chapter 3, immediately formulate a stochastic version of the game dyna- 
mical equations (cf. Refs. [72], pp. 207ff, [58], pp. 239ff). This is given by the 
configurational master equation 



-^P(n, t) — ^ w(n\n f ; t ) — w(n f \n; £)P(n, 0 



with the transition rates 











w a (i'\i]n\t)n^ 


CJ-w 

52 

II 

"62 


T 

II 




if n 1 — n a & a & 
nn - n^yij 




0 


otherwise 



(10.54a) 



(10.54b) 
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(cf. (3.50)) ( w a = w a , w a b N =^ 1 w a b/Nb). Here for w a b(i f we use 

the approach (7.52a) for imitative processes: 

(10.54c) 

Particularly, for the special imitative processes of game dynamics we have 

(10.54d) 

and 

(10.54e) 

with 

(10.54f) 

(cf. (10.11), (10.14), and (10.31)). 

The stochastic formulation of the game dynamical equations allows, on the one 
hand, the calculation of the probability P(n, t) with which possible socioconfi- 
gurations n (i.e. strategy distributions) occur. On the other hand, it allows the 
derivation of covariance equations which are necessary 

• for the calculation of the reliability of game dynamical results which 
is given by the standard deviations y/cr aa , 

• for the determination of the period of time during which game dyna- 
mical equations (10.9) can be interpreted as approximate mean value 
equations (cf. Sect. 3.3). 







Moreover, corrected mean value and covariance equations (cf. Sect. 3.4) 
become derivable. For this purpose, let us introduce the following abbreviations: 



:= w a {i\i'\ n; t) - ^ w a (j\i'; n, t) , 

j 


(10.55a) 


:= w a (i\i'-, n, t ) + <5,-,< w a (j\i'] n; t ) , 

j 


(10.55b) 


= i/ a (t)\R a (i\i';n;t) — R a (i' \i; n; t)] , 


(10.55c) 



Miff := v a {t) [iP , (i|i , ;n;<) + « o (i / |i;n;0 



(10.55d) 
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Then, with (3.15) and (3.24), we obtain 



( r a-/ a\ I M a-( n i)( n i') 

2_s [ L H' \ n i'/ + M U' M 






+ SEE-frE 



r- i n i){ n i') 






a b ( dLfc , dM- «)^ 



w> »w) j 

I r Wr 4i ( Kf a ~ Sab I V i n i') 



as approximate mean value equations (s = 0) and corrected mean value 
equations (s = 1) respectively (cf. (3.16) and (3.47)). The game dynamical 
equations correspond to the approximate mean value equations (10.56a). Howe- 
ver, according to Section 5.1, they can also be interpreted as equations for the 
most probable individual behaviour. The interpretation as approximate 
mean value equations is only possible as long as the variances a aa are small 
(cf. (3.20)). In doubtful cases one should rather use the corrected mean value 
equations (10.56) which themselves depend on the covariances cr a $ and thus 
take into account the fluctuations which result from the stochasticity of the 
described processes. However, the corrected mean value equations require the 
solution of the covariance equations. As approximate covariance equations 
(s = 0) and corrected covariance equations (s = 1) respectively, we find 
(cf. (3.22) and (3.48)): 





242 



10. Evolutionary Game Theory 



+ ^2 a i'l 



c,k 



y (aM) ( i! «(»j) at. ) +< “ p* +JW “ nT 



. ah yp ( M a- ( n j ) . ,ri- ( n j) 

+ i> 2_s 1 M a — + M i'i -jyj- 



+ 6, 



ab \ vH' 



N a 



(10.57a) 



d 2 L“+ 



c,d k,k' 



d{n c k )d(nl) 



K) + 



d 2 M»+ ( n?)(n ]) ' 

d(nl)d{ni,) N a 



+ s 2I^i 

c,k j 



t yKi 



+ s 



ac ( gM »7 + ( n j) . M a+fr»c 
V lV a + M ‘* JV a 



f d 2 w a (i,i';t) t _ as , 0 2 u; a (i , |i;*) / _ ax 

^ U»sw*> M + www) {i) 



+ o 



d 2 M°+ («?)(»?) 



c,d Jk,ib 



' kk> d(n c k )d(n d kl ) N a 



+ s 



+ s 



c,Ar X 

( /T aC 



dtu a (j'|i;l) ac au) a (i|i';t) 
1- 



9{n c k ) 

dM?+ (n?,) 

,) N a 



d{n%) 



+ ff i'k o/_c\ nr + °< 






IVa 



(10.57b) 



An example for the application of these equations and of the game dynamical 
methods is given in the next section. 



10.4.1 Self-Organization of Behavioural Conventions for 
the Case of Two Equivalent Competing Strategies 

In the following let us consider the case of one single subpopulation ( A — 1). 
Then the subscripts and superscripts a, 6, c can be omitted. (A socio-biological 
example for game dynamical equations for two subpopulations can be found in 
Ref. [139], pp. 137ff.) 

If the considered individuals have the choice between only two strategies i £ 
{1,2}, this implies 



n 2 = N — n\ 



(10.58) 
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where N denotes the number of individuals. Furthermore, for equivalent stra- 
tegies the payolf matrix A is symmetrical and of the form 



A = 




A + B B 
B A + B 



(10.59) 



By scaling the time we can achieve 

v(t) = v 



(10.60) 



With 



w^i'lv.n^t) :—W 



(10.61) 



we will assume the simplest case for the transition rates describing spontaneous 
strategy changes. 



An example for the above mentioned situation is the avoidance behaviour of 
pedestrians (cf. Fig. 10.1): In crowds of pedestrians who walk into opposite 
directions, avoidance manoeuvres fail the less frequently (i.e. have the larger 
expected success) the larger the number n t - of pedestrians is who swerve to 
the same side i. Thus in the payoff matrix (10.59) there is A > 0 because it 
is advantageous to apply the same strategy as the interaction partner. Conse- 
quently, we expect a symmetry breaking via a phase transition. In fact, 
according to experience, in Germany and some other countries pedestrians pre- 
fer the right-hand side (i 1) [204,205]. With the same probability, however, 
a preference of the left-hand side (i 2) could have won out which is a priori 
completely equivalent. Since in some countries this case seems also to be given, 
an explanation of the symmetry breaking by the asymmetry of the human body 
must be discarded. 



Some people argue that the reason for the asymmetrical avoidance behaviour of 
pedestrians could be the habits regarding road traffic. In this case, however, a 
similar model as above would be required to explain why in some countries left- 
hand traffic is prescribed (e.g. in Great Britain) and in others right-hand traffic 
(e.g. on European mainland). Here we can assume that the legislator adapts to 
once established norms or conventions. Very probably these conventions first 
developed at pedestrian traffic since the avoidance problem is much older there 
than at vehicular traffic. 



Apart from this, in many countries of the world (e.g. India) vehicular traffic 
plays a completely subordinate role compared to pedestrian traffic, not to men- 
tion that the fewest pedestrians have a driving licence there. At least in these 
countries it is improbable that pedestrians behave according to the perspective 
of a car driver. 



The model for the self-organization of behavioural conventions is not limited 
to the avoidance problem. It can rather be applied to a number of other ca- 
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ses where it is advantageous for the majority to choose the same behavioural 
alternative. Some examples are discussed on pages 4f. 




Figure 10.1: (a) Avoidance problem of pedestrians: For pedestrians who walk 
in opposite directions it will be advantageous if they swerve both to the right 
(solid arrows) or both to the left (dashed arrows). Otherwise they are forced to 
stop in order to avoid a collision. The probability P( 1) to swerve to the right is 
(in Germany and some other countries) normally greater than the probability 
P(2) = 1 — P(l) to swerve to the left. 

(b) In dense pedestrian crowds lanes develop in which one of the two oppo- 
site walking directions (represented by black and white circles respectively) 
dominates. So the frequency of avoidance manoeuvres is reduced. These are 
indicated by arrows. 



For the above described situation the stochastic version of the game dynamical 
equations has the explicit form 



— P(n u n 2 ;t) = 



+ 



with 



w(l|2) + vR{\\2] ni - 1, n 2 + 1; t) 
xP(n i - 1 ,n 2 + 1;<) 
w(2|l) + i/i?(2|l;ni + l,n 2 - 1;<) 



fij — 1 



N 

n 2 — 1 



N 



(«2 + 1 ) 
("1 + 1 ) 



xP(m + 1, - l;<) 



u>(l|2) + j/fl(l|2;7n,n 2 ;t)^i] n 2 P(ni,n 2 ;t) 
w(2\l) + i/R(2\l]n h n 2 ]t)jjr n 1 / > (m,n 2 ;0 (10.62) 



R(j\i] n; t) max n; t) - E(i, n; t), o) (10.63) 
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and 

E(i,n,t):=^2A(i,j)^. (10.64) 

3 

For the calculation of the related mean value equations (10.56) and covariance 
equations (10.57) we require 



L7>> = 


W-Sii'SW, 


(10.65a) 


L+ = 


W + Su>SW , 


(10.65b) 


Mr., = 


v[E(i,t)-E(i',t)}, 


(10.65c) 


M+ = 


v\E{i,t)-E(i\t)\ 


(10.65d) 


E(i, t) 


3 


(10.66) 



In particular, the approximate mean value equations (and thus the game dy- 
namical equations) have, due to P(2,£) = 1 — P(l,f), the explicit form 





W + vAP(i,t)(p(i,t)-ty 



(10.67) 



Obviously, P(z, t) = P°(z ) = 1/2 is a stationary solution of (10.67) which des- 
cribes a situation in which both strategies are chosen with the same frequency. 
This solution will be only stable if the parameter 



k := 1 — 



4 W 
vl 



( 10 . 68 ) 



fulfils the condition k < 0, i.e. if spontaneous strategy changes are so frequent 
that no strategy can win out. 

For k > 0 the stationary solution P°(i) = 1/2 is unstable and the game dyna- 
mical equations (10.67) can be represented in the form 

±PC,t) = -2(p(i,»-\) (p(M)-i^) ■ 

(10.69) 

Consequently, for k > 0 there exist two additional stationary solutions P°(z) = 
(1 + \//c)/2 and P°(i) = (1 — \//c)/2 which are stable. That is, either strategy 1 
or strategy 2 will gain a majority of 100 • y/H percent which essentially depends 
on the (random) initial condition P(l, to) = 1 — P(2, to). This majority will be 
the larger the smaller the rate W of spontaneous strategy changes is. 
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For /c = /c 0 := 0 there obviously occurs a phase transition. This becomes 
particularly clear in Figure 10.2 which shows, for A\— 1 and various values of 
/c, the temporal course of the probability P{n\ ) n 2 \t) = P(ni,N — n\\t) with 
which rii individuals pursue strategy 1 and = N — n\ individuals apply 
strategy 2. (For illustrative reasons we started from a broad initial distribution 
which corresponds to a situation in which the initial state is not exactly known. 
With the unrealistically small number of N = 40 individuals we achieve that 
the distribution P(rai, N — n\ \ t) remains relatively broad in the course of time. 
With increasing population size N the maxima of the corresponding probability 
distribution become more and more distinct. In addition, note that Figures 10.2 
and 10.6 are scaled in such a way that the maximum of P(n \ , N — n \ ; t) reaches 
half of the box height.) 

From Figure 10.2 we can conclude the following: 



• For k < 0 the probability distribution P(n\,n 2 \t) will have only one 

maximum at n\ = — Nj 2: That is, the individuals are very likely to 

pursue both strategies with the same frequency. 

• For k — Ko = 0 there occurs the phase transition. Due to critical fluc- 
tuations (cf. Sect. 5.4) the probability distribution P{ni,ri 2 \i) becomes 
very broad. 

• For /c > 0 the probability distribution P(n\,ri 2 \t) becomes multimo- 
dal: It is very likely that one of the strategies will gain the majority of 
individuals. In the short term the initial condition decides which strategy 
has the advantage. However, since for t — ► oo the maxima are located 
symmetrically with respect to N/ 2, in the long term both strategies have 
the same chance to win over the majority (cf. Sect. 5.3). 

For growing values of k the maxima of the probability distribution 
P(n i,n 2 ]t) which describe the chance that one of the strategies will win 
out become more and more distinct. 



Figures 10.3 to 10.5 compare the exact course of the mean value (ni) = (ni) t 
and the variance (T\\ = a n(t) of n\ ( — ) with the solutions of the corresponding 
approximate mean value and variance equations (• • • ; cf. (10.56a), (10.57a)) as 
well as with the solutions of the corrected mean value and variance equations 
(--; cf. (10.56), (10.57)). 



• For k <C «o = 0 the approximate as well as the corrected mean value and 
variance equations provide good approximations (cf. Fig. 10.3). 
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Figure 10.2: Illustration of the tem- 
poral evolution of the configurational 
distribution P(n\,N — n\\i) accord- 
ing to the stochastic version of the 
game dynamical equations for vari- 
ous values of the control parameter k. 
P(ni,N — ni;t) denotes the probabil- 
ity with which n\ individuals pursue 
strategy 1 and ri 2 = N — n\ individuals 
strategy 2. For k = 0 a phase tran- 
sition takes place: Whereas for k < 0 
very probably both strategies are ap- 
plied with roughly the same frequency, 
for k > 0 one of the strategies will win 
out with a high probability, i.e. a be- 
havioural convention will arise by 
social self- organization. 
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• Since the probability distribution P(ni,iV — n\\t) becomes broader for 
growing values of k, the solutions of the corresponding approximate and 
corrected equations deviate more and more from the exact solution, as 
expected. Because of (3.42), (3.43), they are valid only for the period of 
time during which the relative central moments 



(fa ? 1 -<"?*)) ••■ 

c (t\ — c 0i - am m — ' 1 1 




C m W •*m W — . 


•••«:> 



(10.70) 



fulfil the condition 



(10.71a) 



for certain well-defined values of m. The decisive values for m are 



< e' < 1 



- m = 2 for the approximate equations (cf. (3.42)) 

- m = 3 and m = 4 for the corrected equations (cf. (3.43)). 



As we can see, the curves of the approximate and corrected equations 
begin to deviate from the exact curves roughly at the time t when, for 
the first time, we have | C m (t)\ > 0.04 for the decisive values of m. Hence 
we put 

(10.71b) 



Unfortunately, the examination of the validity of condition (10.71) will be 
impossible for C m (t) with m > 2 if we do not evaluate master equation 
(10.62) but only the corrected mean value and covariance equations. From 
Figures 10.3 to 10.5 and 10.7, however, we can see that condition (10.71) 
will be fulfilled for the decisive values of m if we have 



e* := 0.04 



\C 2 (t)\ = 


<«?><»$) 


< e 


with 


e := 0.12 



(10.72) 



• Comparing (10.71) with (10.72) we expect that the corrected equations are 
valid for a significantly longer time period than the approximate equations 
(cf. Figs. 10.4 and 10.5). The corrected equations have explanatory power 
even for a rather broad probability distribution P(ni, N — n\ \ t) but they 
fail when the probability distribution becomes multimodal. This phase 
transition is preceded by a considerable increase of the covariances afj (t) 
connected with a violation of condition (10.72) in the course of time t . 

Nevertheless, the approximate mean value equation is relevant also for 
those values of k and those times t for which it lost its applicability 
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t 




0.00 £.50 5,00 7.50 10,00 

t 

Figure 10.3: Above: Exact ( — ), approximate (• • •), and corrected ( — ) mean 
values (above curves) and variances (curves below) for a small unimodal dis- 
tribution (/c < 0; cf. Fig. 10.2): The simulation results for the approximate 
equations are acceptable, those for the corrected equations very well. 

Below: Relative central moments C m (t) for the stochastic version of the 
game dynamical equations ( — : m = 2; — : m = 3; • : m = 4). 



as mean value equation — since, according to Section 5.1, it approximately 
agrees with the equation for the most probable behaviour of indivi- 
duals. 
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t 




t 

Figure 10.4: Above: The approximate (• • •) and corrected ( — ) mean values 
(upper curves) and variances (lower curves) show, for a broad unimodal dis- 
tribution (k = 0; cf. Fig. 10.2), deviations from the exact mean value and 
variance respectively ( — ). As expected, the results of the approximate equa- 
tions are worse than those of the corrected ones. 

Below: Relative central moments C m (t ) for the stochastic version of the game 
dynamical equations ( — : m = 2; — : m = 3; • • • : m = 4). A comparison 
with the figure above shows that the approximate mean value and variance 
start moving off from the exact course when Ci(t) exceeds 0.04 whereas for the 
corrected mean value and variance this is true as soon as we have C±(t) > 0.04 
and C*2(0 > 0.12 respectively. The approximate equations are valid for a con- 
siderably shorter time interval than the corrected equations. 
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t 




O.DQ 2.50 5.00 7.50 10,00 

t 

Figure 10.5: Above: For a multi-modal distribution (/c > 0; cf. Fig. 10.2) it is 
only for a short time period that the approximate ( • • •) and corrected ( — ) mean 
values (upper curves) and variances (lower curves) are a valid approximation of 
the exact mean value and variance respectively ( — ). However, the approximate 
equations still describe the most probable individual behaviour. 

Below: Relative central moments C m (t) for the stochastic version of the game 
dynamical equations ( — : m = 2; — \ m — 3; •••: m = 4). A comparison with 
the figure above shows again that the approximate mean value and variance 
deviate from the exact course for C^t) > 0.04, the corrected mean value and 
variance, in comparison, for C^t) > 0.04 and C^) > 0.12 respectively. As 
expected, the approximate equations are valid for a considerably shorter time 
interval than the corrected equations. 
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Modified Game Dynamical Equations 

Note that the probability distributions P(ni, AT — n\]t) in Figure 10.2 show 
a curious crease at n\ = N/ 2. This has its origin in the discontinuity of the 
derivative of R a (j\i; n; t) with respect to ( E a (j , n; t) — E a (i , n; t)) which results 
from the dependence on the maximum function (10.54e). With the approach 

(10.73a) 

that was motivated in Chapter 7 and the expected utility 

(10.73b) 

we can obtain results that will turn out more satisfactory than those of (10.54e). 
This approach also guarantees that transitions from strategy i to strategy j are 
particularly frequent if the expected increase of success A j{E a (or increase of 
utility AjiU a ) is large. If a decrease of success (or utility) is to be expected 
(A jiE a = AjiU a < 0), transitions will be rare. According to the discussion of 
Section 7.2.1, approach (10.73), in contrast to (10.54e), takes into account that 
during a decision on a strategy change the information about the exact value 
of the expected utility U a is usually incomplete. The decisions rather base on 
an estimate 

n b *(t) 

b j 
b j 

= U a (i,n;t) + e' a (i,n;t) (10.74) 

of the expected utility U a {i,n\t) which is evaluated from the successes of the 
preceding interactions. The random variations ej,(i, n; J) of the estimated utility 
{/* (i, n]t) result from the accidental variations of the relative frequencies 

(10J5) 

with which an individual of subpopulation a meets individuals of subpopulation 
b with strategy j (and possibly from other uncertainties). 

In addition to this, by variation of the distance functions D a (j, i; £), approach 
(10.73) allows to take into account the effort 5 a (j, i]t) = In D a (j,i\t) which 
is connected with a strategy change from i to j. In the case of small utility 
differences Aj,f7 a and (j , i\t) 2, by means of a Taylor approximation, 
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Figure 10.6: Illustration of the tem- 
poral evolution of the configurational 
distribution P(n\,N — n\\t) for var- 
ious values of the control parame- 
ter k according to the stochastic ver- 
sion of the modified game dynamic 
equations. Like in Figure 10.2, for 
k = 0 there occurs a phase transition 
which describes the self-organization of 
a behavioural convention. This time, 
the modified transition rates lead to 
smooth distributions and take into ac- 
count the incompleteness of informa- 
tion during the decision on a strategy 
change. 
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we again arrive at the game dynamical equations (10.9). From Figure 10.6 
we find for this case that the modified approach (10.73) yields similar results 
than (10.54e), especially a phase transition at k = 0 (cf. Fig. 10.2). However, 
a disadvantage of approach (10.73) is that, in order to guarantee the validity 
of the corrected mean value equation, condition (10.71) must be fulfilled for 
all m > 2 according to (3.43). This condition is difficult to check and, due to 
the fact that the exact mean value equation depends on moments of arbitrary 
order m, violated early (cf. Fig. 10.7). 




GUDO a, 50 5,00 7,5D 10.0Q 

t 



Figure 10.7: Relative central moments C m (£) for the stochastic version of the 

modified game dynamical equations ( — : m = 2; — : m = 3; :m = 4; : 

m = 5;---:m = 6). The corrected mean value and variance equations become 
invalid as soon as there is C m (t) > 0.04 for any m > 2 or C 2 W > 012. Due 
to the dependence of the exact mean value equation on moments of arbitrarily 
large order m, approximations for this are valid only for a short time interval. 





11 Determination of the Model 
Parameters from Empirical Data 



11.1 Introduction 



The decisive measure of the quality and applicability of a model is the compa- 
rison with empirical data. Hence we require strategies for the determination of 
the model parameters as {7 a (a;,£) or D a {x\x\t) from suitable data material. 
These mostly base on the method of least squares. Since available data 
material is not always complete , we will develop useful methods even for such 
cases. 

After the model parameters have been determined, a sensitivity analysis 
should be carried out. A test for the significance of the model parameters 
normally allows a model reduction. Furthermore, from the viewpoint of in- 
terpretation it is of interest on which factors the utility and distance functions 
of the considered behavioural changes depend. A corresponding decomposition 
with respect to explanatory variables is possible by means of a regression 
analysis in connection with a factor analysis or, alternatively, a ranking 
regression analysis. This is especially important for the prognosis of the 
future behaviour of the investigated system. Examples for the decomposition of 
utility functions into interpretable contributions will be discussed for purchase 
pattern and voting behaviour. 



11.2 The Case of Complete Data 



In the following let us mark empirical data, in contrast to model variables, by 
a subscript or superscript ‘e\ We speak of 6 complete data’ of a considered 
social process if we know all the following quantities 
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wl{x'\x]ti) 
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(11.1b) 


Kb (u) , 
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(11.1c) 
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for a sequence of times 



ti G {^0> • • • > • 



( 11 . 2 ) 



Here the time intervals A t{ t{ — must be considerably smaller than the 

reciprocals of the individual transition rates of behavioural changes, i.e. 



E w ai. x '\ x '^i) ^ ITT 



(11.3a) 



k *'(#*) y y' ’ 

(cf. (7.5)). 

Now a fitting of the model rates 



(11.3b) 



w a {x'\x\ti), «'£*(* / ,l/ , |*»v;<i) (11-4) 

can be achieved by minimizing the GAUSsian error function 



'»£££[ w l{x'\x\ ti)PZ(x, ti) - W a {x'\x] ti)PZ(x, <«•)] 

* a xjtx 1 

+ E E E E K^OpiiV. y; <0 A e (y> <0 

* fc a, 6 (ar,y)^(jr , ,y / ) 

- w ai( x '> y' \ x > v> *i)PZ( x , U)PZ (y, <»•)] 



(11.5) 

under variation of the model parameters (method of least squares; [81,88, 
19]). However, the maximum likelihood method could also be used [82, 
149]. 

For the model rates let us again choose approaches of the form 



w a (x'\x\U) := v a {U)- 



U^x'td-U^xti) 

Da (*® / > *5 ) 



( 11 . 6 ) 



!*»!/> ^*) — V a b(ti)P})a(?t\y\ ^i)^xx' ^xy 1 ^ “ $xy) 

+ Vab{ti)p a b{y\ x i ^i)^yy‘ <^ry) > (11.7a) 
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™ab( x > V 1*5 y\ ti) — ^ab(^i)Pab(^ I* 5 ^i)Pba{V 1*5 ^i)^xy ^xx' (1 ^xy 1 ) 

+ Vab(ti)pli,(x'\y,ti)pl a (y\y,ti)6 X y6yy'(l - 6y X >) 

+ v a b(U)p\ h {x'\x\ ti)pl a (y'\x , f,)«5 ty (l - 6 XX <) 



X(1 — 6 X yl) , 



(11.7b) 



«&(*', y'l*>y;*<) 

= l / ab{ti)Pab(x\x',ti)pb a (y |y! ^t')^rr'(l ~ 4 ei/)( 1 — ^yy')(^ — ^*y') 



+ I'aX^OPaX* I*) ti)Pba(y\y> — ^*yX^ ^xx')(l ^yx') 



+ » / at(<<)Pot(* , l a! ; t t - )p?o(y , |y. i <)( 1 - ^yX 1 - ^'X 1 - <V) 

X (1 $yx 1 ') 



with 






p k ab (x'\x;ti) := fab(ti) 



D a (x',x;ti) 

(cf. Chap. 7). Additionally, for reasons of uniqueness we demand 



~c X^ ® — o X^ ^ a (*’ **) 



and 



5(5^1) E/V*'. *;'<) = i ^ sisTij E ;«i) 

X^X X^-X 



(11.7c) 

(11.8) 

(11.8a) 

(11.8b) 



(cf. (7.37), (7.61)). Then the method of least squares will allow a deter- 
mination of the utility functions U a {x,U), U a (x,ti), the distance functions 
D a {x' ,x\U), D a (x' ,x-,ti), and the interaction frequencies /* t (t). A minimi- 
zation of the error function F with respect to the model parameters v a (U), 
: = KbWfabiti)’ U a (x,ti), U a {x,ti), Da{x',x\ti), and D a (x', a; <,) is 
in general carried out numerically, e.g. by means of the method of steepest 
descent [19,62,80]. 

If P^(x,t) is unknown, instead of using (11.28), we can determine the model 
parameters by applying the error function 

F' := [<(*'!*; t { ) -u»a(aj / |as;t i )] 

* a x^x 1 
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* k a,b (x } y)£(x‘ } y‘) 

(11.9) 

However, the transition rates are not weighted with the magnitude of their 
contribution to dP a {x ) t)/dt, then. Hence this method is not as suitable as a 
basis for prognoses. 



11.3 The Case of Incomplete Data 



For financial reasons and due to special difficulties with the data collection 
of pair interaction rates, one is often confronted with a poor data situation. 
Therefore, let us assume in the following that all we know are the quantities 

PZ(x,U) V*, (11.10a) 

wZ(x'\x;ti) Vx'^x (11.10b) 



for certain times ti 
i that 



€ This time we must presuppose for A ti — 



£ 

x'(£x) 



<(*'!*; u) < ^ 



(li.ii) 



is fulfilled (cf. (7.5)). We will try to describe w“(x'\x; ti) for x' ^ x by the 
model approach 



w a (x'\x;ti) 



Va{U) 



D a (aj / , x ; tj) 



from Chapter 7 where 



+ u a (x',x;ti) 






D a (x',x]ti) 

(11.12a) 



v a (x',x;ti) := X) [(^6^*) (*',<<) 
b 

+ (vli(U)-vl b (ti))Pb(x,ti) + Va b (ti)\ . (11.12b) 



In order to determine the functions j/ a (t,), !&(*.■)> Uaix.U), U*(X,U), 
D a (x' ) x;ti) ) and D a (x', x\U) in a unique way, we again demand that the 
constraints (11.8) are fulfilled. Due to linear dependences of some model para- 
meters, instead of vZb(t 0> an d one can only determine 






(11.13a) 
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»&(*<) : = U ai(U) ~ ^(ti) , 


(11.13b) 


and 




b 


(11.13c) 


However, since for A = 1 we have 




V 3 (ti) = l/ 3 (ti) 


(11.14a) 


and 






(11.14b) 


V 2 (ti) = V 2 (ti) + V 3 (U) , 


(11.14c) 


this problem will not occur if only one single subpopulation 
(A = 1) which is normally the case. 


a is considered 


Now our task is to determine the functions v a (U), ^(2*), 

U a (x,ti), U a (x,ti ), D a (x' ,x\ti), and D a (x' , x\ t{) from the ( n + 1)A5(5 — 1) 
data values w*(x'\x;ti) (x' ^ x) with t{ E x E {*i, . . . , * 5 }, 

a E {1, . . ., A}. Table III shows how many independent quantities belong to 
the different functions. 


Obviously, model (11.12) has 




(n + 1 )A [2 + 2A + 2(5 - 1 ) + 5(5 - 1) - 2 


(11.15) 


independent model parameters, i.e. it is totally underdeterminate by the 
(n -f 1)AS(S — 1) empirical data w* (x'\x; £,). Thus we introduce the additional 
assumption 

i 1 


D a (x',x;t) = Da(x',x-,t) . 


(11.16) 


Then there remain 




(n + 1)A [2 + 2A + 2(5 1) + S ^ S ~ ^ 1 


(11.17) 



independent model parameters. Assumption (11.16) is very plausible since it 
implies that the distance between (dissimilarity of) two behaviours x and x f 
agrees for spontaneous behavioural changes and behavioural changes due to pair 
interactions. On the basis of the distances D a (x , ,x;t) = D a (x f ) x]t ) one can 
construct an abstract behavioural space with the method of multidimensio- 
nal scaling [161,301]. This describes a part of the psychological topology 
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(psychological structure) of the individuals of subpopulation a since the 
distances reflect which behaviours are more or less related (compatible) (cf. Ref. 
[212], Chap. 6). With the dependence on a, the distance functions D a (x , y x m ) t) 
distinguish different psychological structures (personality profiles) which could 
be denoted as ‘characters’ and manifest themselves as different behavioural 
types. 

The model parameters can only be uniquely determined from the empirical 
data on the condition 



2A + 2S + — r — — — 1 < S{S — 1) , 



i.e. 



S > 



5 + y/YJ + 16 A 



(11.18) 



So, for the case of one single subpopulation (A — 1), at least 5 = 6 behaviours 
must be distinguished. 



If the data differentiate less behaviours, we can, for instance, make the further 
simplification 

(11.19) 



This assumes that the readiness R a (x f \x;t) for behavioural changes from x to 
x f due to pair interactions is the same as the corresponding readiness R a (x'\x; t) 
for spontaneous transitions. One consequence of (11.19) is that, instead of v a (t) 
and ^(J), we can only determine 



U a (x,t) = U a (x,t) 



( 11 . 20 ) 



so that even in the case of one single subpopulation (A = 1) the contributions 
by spontaneous transitions and compromising processes are no longer separable. 
However, since the resulting model 



V a {t) := t '„(<) + vl{t) 



w a (x'\x;ti) 



» •; «,) r (,,) + £ faMW’ 

b 



( 11 . 21 ) 



has only 



(n + l)A 



1 + 2A + (S-1) + - ^ — L) -1 



( 11 . 22 ) 



independent model parameters, these can be determined already for 

^ 3+vt+tm 



2 



(11.23) 
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Function 


Constraints 


Number of Indep. Variables 


<(*>;<«•) 


X* ^ X 


+(n + 1 )j 45 2 
-(n + l)AS 


Altogether 


+(n + 1M5(5 - 1) 


Va{ti) 


>o 


+(n + 1)^4 


4(ti) 


>o 


+(n + l)A 




>-«&(*.■) 


+(n + 1 )A 2 


"abiti) 


> -vlb(U) 


+(n + 1 )A 2 


U a (x,U) 


X 


+(n + l)AS 
-(n + 1)A 


Altogether 


+(n + l)A(S-l) 


U a (x,U) 


!5>“0M) = o 

X 


-f-(n + 1)A(5 — 1) 


j 35 j ti) 


> 0, x' ^ * 

D a (x, ti) = D a (x f , x\ ti) 


+(n + 1)AS(S - 1) 
. ,,.5(5-1) 

-(n + lM v 2 ; 

— (n + 1)^4 


5(5-1) ^ x , t} ~ 


Altogether 


+(« + lM ( g(5 ~ 1) - 1 ) 


D a (x',x;ti ) 


Analogously 


+ (n + lM( S <V 1) -l) 



Table III: Number of independent variables of the empirical data and the func- 
tions occuring in the behavioural models. 



























262 



11. Determination of the Model Parameters from Empirical Data 



That is, for one single subpopulation {A — 1) we must distinguish at least 5 = 4 
behaviours and for two subpopulations ( A = 2) at least 5 = 5 behaviours. 

A remarkable aspect of equation (11.21) is that it implicitly takes into account 
compromising processes by effective rates T%{U), ^6 an d u ab^*) f° r s P on " 
taneous, imitative, and avoidance processes respectively. As a consequence, a 
separate discussion of compromising processes is often not necessary. 

A further drastic simplification is, for example, given by the assumption of 
time-independence of the distance functions D a (x* , x;t): 



D a (* , 35 j £) 



S a (x' y x,t ) S a (x l ,x) 

e = e 



D a (x',x) 



Then there remain 



(n + l)A [l + 2 j4 + (5 — 1)] +A 



\S(S - 1) 



-1 



(11.24) 



(11.25) 



independent model parameters which can be determined in the case of a suffi- 
cient number (n + 1) >> 1 of time steps if 



5 > 1 + y/l -K 2 A . 



(11.26) 



In general it is not reasonable to choose a model which contains as many pa- 
rameters as one just can determine from the data. On the one hand, the ex- 
planatory power of the single parameters is small, then. On the other hand, 
surplus parameters often describe only accidental variations of the data in- 
stead of the significant systematic interrelations one is interested in (cf. 
Sect. 11.4.2). 

11.3.1 Parameter Estimation* 

Now the particular way of proceeding for the determination of the model pa- 
rameters will be discussed for the model 



W a {x'\x- ) t i ) := v a {ti)- 



V.(x‘ ,ti)-V a (x,U) 
D a (x',x;ti ) 



+ 1 s a (x',x;ti)- 



U a (x' ,ti)-U a {x,ti) 

D a (x f } x ; /*) 



v a {x', x; ti ) := ^2 <«') + + "a(<0 



(11.27a) 
. (11.27b) 



This section can be omitted during a first reading. 
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An optimal estimation of the parameters can be achieved (similar to the pro- 
cedure in Ref. [294], pp. 33ff) by minimizing the error function 




under variation of i i/f(*,-)> vl h (U), ^(<<), U a (x,U), U a {x,U), and 
D a (x' } x] t{). During this minimization procedure we must take into account 
constraints (11.8), the condition of positivity (7.32), and symmetry conditions 
(7.30b). However, it is possible to include these constraints directly in the for- 
mulation of the optimization problem: In order to guarantee the positivity of 
D a (x , ,x\t) we will use, like in Chapter 7, the representation 

(11.29a) 

(cf. (7.32)). The symmetry conditions (7.30b) can be obtained by the represen- 
tation 

(11.29b) 

Constraints (11.8b) can be replaced by introducing Lagrange multipliers 
A a (ti) and minimizing the function 

(11.30) 

instead of F. Here represenation (11.29) must be used in F. As solutions of the 
minimization problem we find a solution spectrum of optimal model para- 
meters in dependence of the still undetermined Lagrange multipliers \ a (U). 
Now we choose the Lagrange multipliers X a (U) just in such a way that the 
constraints (11.8b) are satisfied. The minimum of F* agrees with the minimum 
of F ) then (cf. (11.30)). 

The minimum of F 1 is characterized by a vanishing of the total differential 
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+ EEL 



SF 



+ EEE 

a x^x 1 i 



SF 



6U a (x,U ) 



+ EE 



A a (t,)e 

SF 



6G a (x,x',ti ) 

G a^x* ,x,t i)+G a (ar,a? / ;ti) 



6G a (x, x'\ti) 






SF 



+ EE 

a i 

+ EE 

a, b i 

+ EE 



^ 1 



a, 6 i 

SF 



i LK\(<*)J 

Except for the restrictions 



6v a (ti) 

bvliU) 
6v h,{U) • 



SU a (x s ,U) = - £ ttf„(*'.*«). 
SU a (x s ,ti) = - J2 



(11.31) 

(11.32a) 

(11.32b) 






given by the constraints (11.8a), all model parameters can be varied indepen- 
dently of each other. Inserting (11.32) into (11.31) and demanding SF' = 0 
leads to the minimum conditions 

SF 



$U a (x , t{) 
SF 

6U a (x, t() 
SF 

SG a (x, x * 5 
SF ± 
Sv a (ti) 

SF 

6v l(u) 

SF 

tiv'abiti) 



Haiti) — 0 


(* # *s) , 


(11.33a) 


- //“(<*) = 0 


(* # *s) , 


(11.33b) 


—j - A a(ti)e 


s.(*',*;<0+G.(*.*';‘i) 1 q 


(11.33c) 


0, 




(11.33d) 


0, 




(11.33e) 



= 0. 



(11.33f) 
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SF _! 

^ab (*i) 



with 



t*a{U) ■— 
^(ti) := 



SF 

SU a {x Sy U ) ’ 
6F 

6U a (x s ,ti) ' 



Using the abbreviations 



(11.33s) 



(11.34a) 

(11.34b) 



f%x', x; U ) := (<(x'|x; U) - e- S * (x '’ x;ti) g a (x', x; <«•)) (^0*, U)) 



(11.35a) 






the functional derivatives are given by 

6F 

t.A ~ 

x‘(^x) 



(11.35b) 



SU a {x,ti) J-J L 









-f a {x' ,x\ti)e 



(11.36a) 



xe 






U a (x',t,)-U a (T,t,) 



SG, 



xv a (x' , x; <,)e 
/f . . x = e _S “ (j: ' X,1 ‘ ) [/ a (x , ) x-,ti)g a (x', x;ti) 

a V* j > ti) L 

+/ a (*, *';<.)ir a (*, *';<») 



«5f 



<5^a(<«) 



t = ~5 3 •f a ( x, ’ x ; <i ) e 



-s a (*',ic;<0 



(11.36b) 



(11.36c) 

(11.36d) 



X^X 1 



SF 



\ ~ Y,f a ( x '> x ^ e 



(U36e) 






__ — 53/ a (x',x;t,)e 



-S a (x',X]ti) U a (x‘ < ti) — V‘(x,ti) 



foabfa) 



x^x' 

xPb(*',ti)> 



(11.36f) 
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rT/i \ “ * yn ,x,ii)e e 

tv 2 ab (ti) ^ 



x£x‘ 



Obviously we have 
6F 

8U a (x,ti ) 



X^X* 

I \ -5 o (x , ,x;t0 ra/ / . \ 

x^x 1 

= o ai. 



and analogously 



0 


(11.37a) 


o 

II 

w- 


(11.37b) 


lr-. SF 

A'a(fi) = s z2 SU a {x,ti) ~ ^ 


(11.38a) 


1 C T? 

** ^ ~ S^6U a (x,ti) ~° ' 


(11.38b) 



Hence conditions (11.33a) and (11.33b) can be rewritten in the form 

6F 

m^u) = “■ (1U9s) 

w^m) = °- (1139b) 



Next we must determine the Lagrange multipliers \ a (ti). By taking into 
account constraints (11.8b), from (11.33c) and (11.29b) we get the relations 



^a(U) — A a (ti ) n/q _ i\ ^ 



5 a (x / ,a;;ti) ! 1 






Inserting (11.40) into (11.33c), we find 



S(S -l)£,6G a (x, x';ti) 4 



(11.40) 



SG a (x,x';U) \ 5(5-1) SG a {x,x'-U ) 



S a (x' ,x;ti) 



(11.41) 
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and, as desired, 



1 Sa(»;fO 1 

TTiZ.' =1 



S(S-l) 



x^x 1 



if only conditions (11.33c) are fulfilled. 



(11.42) 



Another possibility to take into account constraints (11.8b) is to minimize the 
error function F with respect to v%(ti ), *&(*<). U a (x,ti), U a (x, ti), 

and D a (x', x\U) using representations (11.29). In contrast, v a (U) must be kept 
constant. Without loss of generality we can put v a {ti) = 1. Then, instead of 
(11.33c), we have the minimum condition 



6F 

6G a (x i x , - 1 t i ) 



i 

= 0 . 



(11.43) 



For the unsealed distances D a (x ' ,*;£*•) we obtain, from (11.43) and (11.36c), 
the useful relations 



Da{x 



^a(*E j ^t) 

dd (fl3^ ) X , t( ) 



(11.44a) 



(cf. (11.29a)) with 



and 



/ 


\ 2 / 




\ 2 


n a (x ,x;t { ) := [g a (x ,x;ti)P*(x,ti 


)) +(i 


f(x,x']ti)P*(x',t 


■•>) 



(11.44b) 



da{x j X] tj) 



:= <(x'|as; t i )g a (x' ) x; U) (P a e (a;, <,)) 

+ w*(x\x'\ ti)g a (x,x'; ti)^P*(x', £,•)) . 



(11.44c) 



If the error function F is minimal for the parameters Ua{XiU), U a (x,ti), 
D a (x',x;U), v a {ti) = 1, vl(ti), v l ab {U), and v^U), the same is, according 
to model approach (11.27), the case for the scaled parameters 



U e a (x,ti) := Ua{x,U)-Ca(ti), 



U“(x,ti) := U a (x,ti)-C a (ti), 



D e a(x',X-,t>) 






D a (x ,x',tj ) 
D a (ti) 

Vg (ti) 

D a {ti) ’ 



(11.45a) 

(11.45b) 

(11.45c) 



(11.45d) 
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v?(U) := 



D a (ti) 



ai[t) - D a (U) ’ 
D a (U) ' 



== 



(11.45e) 

(11.45f) 

(11.45s) 



In order to fix the scaled parameters in a unique way, we again demand that 
U£(x,ti), U£(x,ti), and D e a (x’ , x\ <,) satisfy the constraints (11.8). This leads 
to the relations 






C a (U) :=^U a (x,U) 



and 



D a 



(*») s(S — 1) ^ ,x;U) 



x^x 1 



(11.46a) 

(11.46b) 

(11.47) 



C a (ti) and C a (ti) have the meaning of mean utilities whereas D a (ti) repres- 
ents some kind of a unit of distance. 



Since an analytical solution of the minimum conditions (11.33), (11.36) is not 
possible, for the explicit determination of the model parameters from empi- 
rical data we must use numerical, computer-aided methods. Especially suited 
for this problem is the method of steepest descent [19,62,80]. The estima- 
ted parameters should be carefully checked for their plausibility because the 
minimization problem possibly has several (relative) minima. That is, the so- 
lution found can depend on the initial approximations for the parameters. 
Among other things, a reasonable solution must yield the non-negativity of 
the parameters v a (ti) and !/£($,•). 



11.3.2 Model Reduction 

The model for behavioural changes contains a lot of variables from which cer- 
tainly not all are significant for the description of the system. Thus a syste- 
matic simplification of the model (i.e. a model reduction) is desirable (cf. 
Ref. [190], pp. 461ff). For example, we can test the hypothesis Ho : "ab = 0 
that compromising processes yield no significant contribution to the considered 
behavioural changes. This is done by means of a statistical significance test 
(in linear cases e.g. with the F-test: cf. Ref. [190], pp. 663ff) which checks 
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whether certain parameters can explain an essential part of the variance of the 
quantities that are to be described. 



11.4 Migration in West Germany 



In the following we will illustrate the method for the determination of the model 
parameters from Section 11.3 by an example. It is not easy to find suitable data 
for that since data usually only exist for the temporal evolution of a certain 
behavioural distribution Pjftx,ti) but not for the corresponding transition rates 
w*(x f \x] t{). However, for some countries all necessary data are available for 
migration, i.e. for removals from one region to another (cf. Ref. [294]). In this 
case the behaviour x = x describes in which region x £ {1, . . .,5} out of S 
distinguished regions an individual lives. 



Here the results for migration in West Germany will be presented. West Ger- 
many was subdivided into ten federal states and West Berlin (cf. Fig. 11.1 and 
Tab. IV). 




Figure 11.1: Subdivision 
of West Germany into 
eleven federal states. 
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With respect to migration there can be assumed a more or less homogeneous 
population so that there is A = 1 and the subscripts/superscripts a, b can be 
omitted. Hence the corresponding migration model reads 



w(x'\x;ti) := 



i/ e (<,)e 



(/‘(x', ti )- [/'(*, ti) 



D*{x',x]U) 

+ (v u {U)P\x' ,U) + , 

(11.48) 



if the rather improbable compromising processes are neglected. 

The data for the 5=11 regions can be found in the Statistische Jahrbiicher 
which are edited in Wiesbaden (Germany) on an annual basis. In the follo- 
wing the n + 1 = 26 years to = 1960, = 1961 to 1 26 = 1985 will be exami- 

ned. Figure 11.2 shows the time- dependent population distribution P e (x , t t ) 
whereas Figure 11.3 depicts some examples for the temporal variation of the 
effective transition rates w e (x / \x;ti). 



Symbol 


Region x 


Name 


+ 


i 


S chles wig- Hols tein 


© 


2 


Hamburg 


A 


3 


Lower Saxony 


□ 


4 


Bremen 


O 


5 


North Rhine- Westphalia 




6 


Hesse 


N 


7 


Rhineland-Palatinate 


X 


8 


Baden-Wiirttemberg 


V 


9 


Bavaria 


\ 


10 


Saarland 


/ 


11 


Berlin 



Table IV: The eleven federal states of West Germany, their names, and the 
symbols that will be used for their graphical representation. 
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1 960 1965 197D 1975 i960 1965 

t, 

Figure 11.2: Time-dependent population distribution over the various federal 
states of West Germany. 




Figure 11.3: Temporal dependence of some effective transition rates of migra- 
tion in West Germany. 
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11.4.1 First Model Reduction 



Applying the method of steepest descent, the minimization of error function 
(11.28) provides the result 

v 2e (U) < v e (U) < v le (U) or v 2e (U) & 0 . (11.49) 



Thus avoidance processes are negligible from an empirical point of view though 
theoretically they could play a role (e.g. for individuals who move away from a 
town because they do not bear its population density any longer). Consequently, 
the migration model reduces to 



w(x'\x] ti) := 



1 



D e (x , 1 x]t i ) 



i/ e (U)e 



u'ix'jiyu'ixji) 



+v le (ti)P e (x\ti)e 



U'ix'jiyu^xji) 



(11.50) 



The optimal parameters for this model according to the method of steepest 
descent are illustrated in Figures 11.4 to 11.6. Figure 11.4 shows the rate i/ e (< t ) 
of ‘spontaneous removals’ as well as the rate v le (U) of ‘imitative remo- 
vals’ which are induced by personal contacts to the destination region. The 
utility functions U e {x ) t{) for spontaneous behavioural changes and the utility 
functions U e (x , ti) for imitative processes are depicted in Figures 11.5 and 11.6. 



11.4.2 Second Model Reduction 



The irregularity of the utility functions U e (x,ti ) indicates that these might 
reflect accidental fluctuations of the migration data. Thus we will examine the 
correlation coefficient 



^2 (w{x'\x-,ti) - w) (u> e (x'|x;<j) - W e ) 

* X^X 1 

(«’(*'!*;*<) - w ) «/]C - w e J 

y i x^x 1 y i x^x 1 



(11.51) 



between the empirical transition rates w e (x'\x;ti) and the model rates 
w(x f \x; ti) for various models w(x f |x; ti). Here we introduced the average rates 

v 7 v ' i x^x 1 



and 



(11.52b) 
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Figure 11.4: Rate v e (U) of spontaneous removals (□) and rate i/ le ($ t *) of imita- 
tive processes (A) according to the migration data for West Germany. 




Figure 11.5: Utility functions for spontaneous removals from one federal state 
of West Germany to another. 
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Figure 11.6: Utility functions for removals from one federal state of West Ger- 
many to another due to imitative processes. 




is also of particular interest. It describes how many percent of the empirical 
variance 

( , + ihs-i ) £IH^>- |I '-)‘ < n - 54 ) 

v ' v ' i x^x 1 

of the data are explained by a model w{x , \x\ti). 



Whereas model (11.50) has a correlation of r — 0.9997 and reconstructs 100.0% 
of the empirical variance, the contribution 




(11.55) 



describing spontaneous transitions reproduces 73.6% of the empirical variance 
with a correlation of r = 0.981. Here it is essential that, during the calculation 
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of (11.53) for contribution (11.55), we used the same model parameters as were 
determined for the total model (11.50). The contribution 



w(x'\x;ti) 



D*{x',x\ti) 



P e (x',ti)e 



U*(x‘ 



(11.56) 



representing imitative processes has a correlation of r = 0.694 and explains 
only 5.2% of the empirical variance. Hence contribution (11.56) is practically 
insignificant, and the migration model can be reduced to the form (11.55) 
which, interestingly enough, agrees with the migration model of Weidlich 
and Haag (cf. Ref. [294]). 

For the reduced model (11.55) once more a parameter optimization is carried 
out by means of the method of steepest descent. This yields the mobility rate 
i/ e (£,) depicted in Figure 11.7 and the utility functions U e (x,t{) illustrated in 
Figure 11.8. The effective distances D e (x ', x\ ti) can be calculated via formula 
(11.44). They are not only a measure for the average geographical distances 
but also for the transaction costs (e.g. removal costs) and for the differences 
of mentality, language, etc. which influence the removal behaviour. 




t. 

i 



Figure 11.7: Mobility rate for spontaneous removals in West Germany according 
to the migration model of Weidlich and Haag. 

A comparison with Figures 11.4 to 11.6 shows that the optimal parameter 
values of the reduced model (11.55) are different from those of the total model 
(11.50). For example, the utility functions U e (x,ti) for spontaneous transitions 
now reflect also tendencies which in the total model were delineated by the 
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utility functions U e (x , t{) for imitative processes (compare the curves for Berlin 

(/))• 







Figure 11.8: Utility functions of the single federal states of West Germany for 
the migration model of Weidlich and Haag. Apparent is the extreme depen- 
dence of the utility of West Berlin (/) on changes of the political situation. 
For example, a remarkable utility increase is observed after the erection of the 
Berlin wall in 1961. 



11.4.3 Comparison of the Weidiich-Haag Model and the 
Generalized Gravity Model 

The correlation of the Weidlich-Haag model (11.55) with the empirical mi- 
gration rates w^x'lx;^-) for the parameters depicted in Figures 11.7 and 11.8 
is r = 0.996. Hence this model reproduces the empirical data almost perfectly. 

Astonishingly enough, apart from the Weidlich-Haag model there exist other 
models which have the same number of parameters and show exactly the same 
correlation with the empirical data. This applies, for example, to the genera- 
lized gravity model 



w'(x'\ x;U) 



D e (x',x-,ti) 






(11.57) 
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(cf. (7.69)) because of the following reason: If the model parameters of the 
generalized gravity model are calculated via the formulas 



U e (x, U ) := U e (x, U) - In ^ P e (x,ti ) + -1 ^ In s/P e {x', *,•) 



(11.58a) 



and 

v u {U) 1 v e (tj) 

D e {x', x ; U) ‘ y/P*{x,U)P*(x' ,U) D e (x',x\ U) 



(11.58b) 



from the optimal model parameters for the Weidlich-Haag model, we have 



w\x f \x]ti) = w(x f \x;ti ) . 



(11.59) 



Whereas the Weidlich-Haag model bases on spontaneous removals, the gene- 
ralized gravity model supposes imitative processes. Obviously, apart from the 
correlation coefficient r we require another criterion in order to decide which 
of the two models describes migration in the most adequate way. For this pur- 
pose, we utilize the circumstance that the effective distances D e (x f , x \ £,) and 
D e (x\x;ti) respectively between two regions x and x f should be nearly inde- 
pendent of time t{. In particular, they should not depend on the population 
distribution P e (x,ti) which, however, is inevitably the case for one of both 
models due to relation (11.58b). Thus we will prefer that model for which the 
corresponding distances D ;;(a? x , a?;f,-) possess the smallest average variance 



n 2 



1 

(n + 1)5(5 - 1) 



EE 

i 



D (x',x]ti) 



1 

n + 1 



yi > x \ tj) 



(11.60) 

in the course of time. This criterion decides clearly in favour of the generalized 
gravity model. 



Figure 11.9 depicts the mobility rate v le (ti) for the generalized gravity mo- 
del (11.57). The corresponding utility functions U e (x,ti) are illustrated in Fi- 
gure 11.10. A comparison with Figure 11.8 shows that the utility functions 
U e (x,ti) of the generalized gravity model possess a considerably smaller va- 
riance than the utility functions t/ e (z,£i) of the Weidlich-Haag model. This 
can be interpreted in such a way that the utility functions U e (x,ti) of the 
Weidlich-Haag model still depend on the population distribution P e (x,U) 
(cf. Ref. [294], pp. 74ff) which seems not to be the case any more for the utility 
functions U e (x y ti) of the gravity model. 




U.(x.y 
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Figure 11.9: Mobility rate for removals in West Germany according to the 
generalized gravity model. 
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Figure 11.10: Utility functions of the single federal states of West Germany for 
the generalized gravity model. 
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11.4.4 Third Model Reduction 



The replacement of the time- dependent distances D e (x' ,x;ti) by time- 
independent effective distances 



£•(*'. x ) := X D *( x> ’ x; **) 



allows a further model reduction. The remaining model reads 



U* yX , X) 



Then the optimal value for the mobility rate i/ 1 *(£*•) is given by 



X — - — 

“ D e (x', x\ ti) 



U U (ti) = I/ le (i,-) — 



y L 

n . (r ' 



x-£x 



\ D t (x',x) 



since this relation best fulfils the demand 

v U (U) 1 v le (U) 



D*(x',x) D e (x',x;ti) 



(11.61a) 



(11.61b) 



(11.61c) 



(11.62) 



The improved mobility rate i/ 1 *(£,•) is depicted in Figure 11.11. Moreover, 
Table V lists the time-independent effective distances D*(x',x). 



For the description of the 5(5 — l)(n + 1) = 2860 effective transition rates 
w e (x* \x] ti) the final model (11.61b) requires only (5+ l)(n + 1) + 5(5 — l)/2 = 
367 parameters. That is, we have achieved a data reduction to 367/2860 = 
12.8%. For the description of one region the model needs, on average, only 
367/[5(n + 1)] = 1.28 parameters per year. Despite this, model (11.61) has a 
very good correlation of r = 0.98. 



11.5 Evaluation of Empirically Obtained Re- 
sults 

11.5.1 Sensitivity Analysis 

After determining the model parameters from empirical data, a sensitivity 
analysis of the resulting model with respect to small variations of the para- 
meters should be carried out [34,268,307]. This is necessary in order to realize 
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Figure 11.11: Improved mobility rate of removals in West Germany for the 
generalized gravity model with time-independent effective distances. 
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Table V: Time-independent effective distances D*(x',x) between the eleven 
federal states of West Germany according to the generalized gravity model. 
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whether a possible deviation of the estimated value of a parameter 

<M<t) G {^a(<«), I'fOi), V l ai {U), *&(*«)> U a (x, ti), U a (x, t{), D a (x' , X \ *,•)} 

(11.63) 

from its actual value would have an essential influence on the model behaviour. 



The way of proceeding in a sensitivity analysis is as follows: First, the temporal 
evolution of P a (a;,£) is calculated according to the BoLTZMANN-like equations 
(7.62) for the estimated model parameters = $*(£,). Then one separa- 
tely varies each model parameter by a small amount 5$* and computes the 
corresponding temporal evolution P^x^t). A suitable measure for the dissimi- 
larity of P a (x,t) and P^x^t) is provided by Theil’s inequality coefficient 
[272] 



(11.64) 



which lies between 0 and 1 and deviates the more from zero the more diffe- 
rent P a (x,t) and P'(aj,<) are. As the absolute sensitivity of the model with 
respect to a change of parameter we define the quantity 

(11.65) 

whereas with the term ‘relative sensitivity’ we mean 

( 11 . 66 ) 

where <5P a (a;,<) := P^(x,t) — P a (x,t) [307]. 






11.5.2 Decomposition of the Utility Functions with Re- 
spect to Explanatory Variables 

The utility functions usually split up into different contributions: 

L 

U a (x,U) = ^c l a (x)U[(x,ti) , (11.67a) 

f=i 
L 

U a (x,U ) = £c?(*)£7'(M,-). 
z=i 



(11.67b) 
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Here U l a (x,ti) represent different influencing factors (which are often inde- 
pendent of * and a). By a decomposition with regard to influencing factors we 
try to express the time- dependence of the utility functions U a (X)ti), U a {x,ti) 
by a smaller number of easily measurable quantities U l a (x y ti). The estimation of 
the constants c^x) and Cj*(a;) is carried out by means of a regression method 
(cf. Refs. [190], pp. 637ff, [81], pp. 415ff, and [82], pp. 494ff). It presupposes the 
knowledge of the utility functions and influencing factors for (n + 1) > ASL 
times 

Often the temporal evolutions of different influencing factors are correlated 
with each other. Thus one tries to express the utility functions U a {xj £), U a (x ) t) 
by uncorrelated and, in addition, interpretable explanatory variables 

Ul(x t t i ):='EM$(x)Ul(x,t i ). (11.68) 

3 

Then we have 

L 

U a (x,ti ) = 'Y^c l a (x)U l a (x,t i ) , (11.69a) 

i=i 

L 

U a (x,ti) = 52ct(x)U' a (x,ti) (11.69b) 

1=1 

with 

4(*) := , (11.70a) 

3 

c?(x) := £c?(*)(M‘)-/. (11.70b) 

3 

Suitable transformation matrices M_ a = can be determined by a factor 

analysis (cf. Ref. [149], pp. 401ff). 

A decomposition of the utility functions with respect to explanatory variables is 
also possible in one single step by means of the ranking regression analysis 
(cf. Refs. [238,294]). 



11.5.3 Prognoses 

A particularly interesting topic is the prognosis of behavioural changes [97, 
155]. The simplest prognosis technique would certainly be an extrapolation 
of the temporal evolution of P a (x , t). By doing this, however, the knowledge of 
the processes that take place in the system is not utilized. This can, especially 
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due to the non-linearity of the dynamics, easily lead to incorrect prognoses. 
A more reliable prognosis method bases on the extrapolation of the explanatory 
variables. P a (x,t) is then calculated in accordance with the BoLTZMANN-like 
equations (7.62). 



11.6 Examples for Decompositions of Utility 
Functions 

11.6.1 Purchase Pattern 

The decision on the purchase of a product x = i E {1,...,S} of a certain 
kind of good will decisively depend on the utility U a (i , t) of the product for the 
purchaser. From the point of view of sales strategy it is of special interest 
which factors determine the utility of a product. The empirical data that are 
necessary for this are collected in market analyses and surveys. In the follo- 
wing let us assume that the utility functions C/ a (i,f) and U a (i,t) have already 
been estimated from the data (cf. Sects. 11.2 and 11.3). 

First, it is reasonable to split off the influence of the number n?(f) of purchasers 
from the transition rates: 



— Ua(z,TI>,f) I — S a ( *, 0 + ^2 • 



b 



b,c 



(11.71) 



The terms 

- ]TV a & c n •(<)«,•(*) (11.72) 

b b,c 

describe the feedback effect of the purchasers’ decisions on the purchase 
decisions. The single contributions to (11.71) can be interpreted as follows: 



• K a b are agglomeration parameters. For K a b > 0 individuals of subpo- 
pulation a will tend towards purchasing a product i the more frequently 
the more individuals of subpopulation b decide in favour of it. So trend 
and fashion effects can be taken into account. (With regard to clothing, 
for example, for the customer one important aspect is to receive attention 
which represents an essential contribution to the utility.) 

• ( T a bc are saturation parameters. For <r a & c > 0 they reflect the avoidance 
of products which are widely spread and thus no longer individual or 
original. 
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• The inclination parameters 6 a (i,£) delineate the part of the utility of 
a product which is independent of the behaviour of other purchasers, i.e. 
they describe the degree of preference for or refusal of a product i by 
individuals of subpopulation a. 6 a (i,t) is determined by the purchaser 
interests and product characteristics. 

The objective of the following discussion is to decompose the inclination pa- 
rameters 6 a (i,t) into the various factors which are relevant for the purchase 
decision. For this purpose, let us consider L different characteristics / of the 
products z, e.g. price, standard/service, quantity, perishability /durability, co- 
lour, and shape. Moreover, let d l a (i,t) with 

(11.73) 

be a measure for the difference between the ‘ ideal 9 and real characteristic 
/ E {1,...,L} of product i E {1,...,5} for individuals of subpopulation a. 
The quantity d l a (i,t) can be collected by means of a survey using a rating 
scale similar to the semantic differential (cf. Sect. 6.2.6) and will further be 
denoted as dissatisfaction with respect to characteristic /. We expect that the 
inclination towards a product i decreases with growing dissatisfaction d l a (i,t). 
This effect will be the stronger the larger the relevance r l a of characteristic / 
for individuals of subpopulation a. Hence a plausible approach is 

L 

6 a (i,t) := ir 0 (i) “ 5^ r i4( l '.0 - C a (t) - (11.74) 

J=i 

Here C a (t) is a scaling parameter (cf. (11.8a)) and 7r a (z) with 

(11.75) 



denotes the prestige which a product i has among subpopulation a indepen- 
dently of the L rated characteristics /. 

In summary, the model for the utility functions E/ a (i,t) has the form 

L 

Ua(i,t) := 7T a (i) + ^K<x»^(i) - ^ <r a4e nj(<)n?(f) - ^r^(i,t) - C a (t) . 

b b } c 1=1 

(11.76a) 

For the utility functions U a (i,t ) that are related to pair interactions we choose 
an analogous approach: 

L 

U a (i,t) := *“(i) + - £<r a6o «.-(<K(*) - Erfd'tM) - c a (t) 

b b,c 1=1 



J3»«(*)-o 



o <<**«(*, t) < 1 



(11.76b) 
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with 



53 x °(* 


) = o 


i 





(11.77) 



By distinguishing (11.76a) and (11.76b), we take into account that a characte- 
ristic / possibly has another relevance at pair interactions (i.e. conversations) 
than at spontaneous opinion changes ( r l a (t ) ^ r ? a (£)). 



For the determination of the coefficients w a (i), /c a &, < 7 a $ c , r l a , 7r a (i), /c a6 , a abc , 
and rf we must carry out a regression analysis. A certain characteristic / 
will only yield an essential contribution to the utilities [/ a (z,t) or U a (i,t) of a 
product i if 



1. the dissatisfaction with respect to / is not negligible (i.e. there is not 
d l a (i,t) < 1), 

2. the relevances r ] a and rf of the characteristic l are large, 

3. the relevances r l a and rf are significantly different from zero (i.e. the 
variations of the terms r l a d l a (i,t ) and rfd l a (i,t) can explain an essential 
part of the variations of U a (i,t) and U a (i,t)). 

As a consequence of the first criterion, from the economist’s point of view it 
is not very reasonable to improve product characteristics which are already 
close to the purchasers’ wishes (unless the expectations of the purchasers can 
be raised by clever advertising). 

In order to take into account also those individuals who do not decide on the 
purchase of one of the products i E {1, . . . , 5}, an additional state i = 0 can 
be introduced. For the corresponding utility functions we choose 



(11.78a) 



(11.78b) 



since, this time, it is not the product characteristics which play a role but the 
feedback effects. 



U a ( 0, t) := 7T a (0) + ^2 K <»» n o(*) - 53 <7 'a»c n o(0 n o(0 - Ca(t) 

b b,e 



U a (0, t) := n a (0) + £ K ab n b 0 (t) - £ <7 a4 ‘n‘(<K(<) - C a (t) 

b b,c 



11.6.2 Voting Behaviour 

In the case of a voter’s decision (electoral decision) among several parties the 
states i E {1, . . . , 5} denote just the S parties which can be elected. By i = 0 
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we take into account the possibility to vote for none oi the parties. Analogously 
to Section 11.6.1, from the utility functions we first split off the dependence on 
the numbers nf (t) of voters (the feedback effects or influences of indirect 
interactions): 



(11.79a) 




• This time the agglomeration parameters K a b reflect that a party i is 
the more preferred the more individuals vote for it. On the one hand, the 
reason for this might be that people believe that a big party is capable 
of exerting more power to make its political goals come true. On the 
other hand, individuals orientate themselves by the behaviour of their 
social environment especially if it is a widely disseminated behaviour 
(bandwaggon effect). It is particularly the so-called people’s parties 
with a high proportion of loyal voters which profit from this fact. 

• The saturation parameters <r a & c describe a reduction of the bandwag- 
gon effect in cases where a party already has a huge number nf of follo- 
wers. This reduction results from the attempt to avoid the predominance 
of one single party i. (A well working democracy requires a balance of 
power, in particular, an opposition that is not too weak.) 

• The inclination parameters 6 a (z,£) delineate the degree of preference 
for or refusal of party i by subpopulation a which is independent of the 
number of its followers. 6 a (z, t) is determined by the voters’ interests 
and the goals of the parties (party programmes). 

This time the inclination parameters will be decomposed in the sense of a 
competence model: 



(11.79b) 



According to (11.79b), a party i E {1, . . .,5} will be the more preferred with 
respect to an affair / E {1, . . . , L] the higher its competence k l a (i,t) and the 
stronger the need for action 



(11.80) 



in this affair from the standpoint of subpopulation a. 




S a (i, t) := ir a (i) + ^ c^D^k^i, t) - C a (t) 

i 



• With 



0 < k l a (i,t) < 1 



(11.81) 
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the competence k l a (i } t) is demanded to lie in the interval [0, 1]. It can be 
determined by surveys using suitable rating scales. 

• c l a represents a measure for the importance of affair l for individuals of 
subpopulation a. 

• D l a (t) with 

(11.82) 

is a measure for the discrepancy between the ‘ideal* and real state of 
society from the viewpoint of subpopulation a with regard to affair /. 
Relevant affairs l are, for example, the topics unemployment, purchasing 
power, standard of living, housing market, leisure time, ecology, abortion, 
or armament. 

• Ka{i) with 

(11.83) 



reflects the prestige of a party i within subpopulation a. 

• C a (t) denotes a scaling parameter (cf. (11.8a)). 

(11.79) implies that a party i can, with respect to affair /, gain political attrac- 
tiveness in various ways: 

• The party can increase its competence k l a (i,t ) with respect to affair /. 

• It can profit from an increase of the discrepancy D l a (t) between the ‘ideal’ 
and real state of society which is possibly caused by a deterioration of the 
social situation. However, a growth of discrepancy D l a (t) is also achie- 
vable by raising the expectations about the ‘ideal’ state of the society. (If 
the ‘ideal’ state with respect to a certain affair l is reached ( D l a (t ) = 0) 
there will be no longer a need for action H l a (t). Thus, with respect to this 
affair there will exist no reason to prefer party i to another party i' .) 

• The party can also try to gain public interest in affairs / in which it is 
especially competent, i.e. to increase the importance c l a of these affairs. 
In addition, it can try to play down or to hush up affairs where it is 
incompetent. 

The approach and interpretation concerning the utility functions U a (i,t) for 
pair interactions is completely analogous: 

U a (i,t) := 6 a (i,t) + - 5> a » e nJ(*K(<) I, 

b b,C 



X* «(*) -° 



o < D l a (t) < 1 



(11.84a) 
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(11.84b) 



(11.84c) 

By distinguishing (11.79) and (11.84) we take into account that it is possible 
for an affair / to have another importance at pair interactions (i.e. discussions) 
than at spontaneous opinion changes ( c l a (t ) ^ cf(t)). 

Finally, like in (11.78), we assume for the utility to vote for none of the parties 
U a (0,t) := M°) + ][^Ka»no(<)-]Ptr a6c n£(<) n ;i(0-Ca(^ (11.85a) 

b b t c 

U a (0,t) := + (H-85b) 

b b,c 

An alternative model for voting behaviour would, analogously to Section 11.6.1, 
be provided by approach (11.76) which expresses the preference for a certain 
party i by the dissatisfaction d l a (i y t) of the voters of subpopulation a with 
the programmes of the parties i with respect to different affairs /. A comparison 
shows that model (11.79), (11.84) can be transformed into model (11.76) by 
means of the identifications 

(11.86a) 

(11.86b) 
(11.86c) 

C a (t) := C a (t)-^2c l a D l a (t), (11.86d) 

l 

C a (t ) := C a (t)-')TctD , a (t). (11.86e) 

/ 

Accordingly, the relevances r l a (t) and rf(t) of an affair l are given by the needs 
for action c l a (t)D l a (t) and cf(t)D l a (t) respectively. Moreover, the competence 
k l a (i,t) — 1 — dj,(i,<) of party i with respect to affair / from the viewpoint of 
subpopulation a decreases with growing dissatisfaction d l a {i,t), i.e. with gro- 
wing distance between the party programme and the political expectations of 
subpopulation a. 
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Obviously, the identifications (11.86) are quite plausible so that the models 
(11.76) and (11.79), (11.84) are compatible. However, in the case of sufficient 
data, the competence model (11.79), (11.84) should be prefered because, with 
(11.86a), (11.86b), it additionally describes the time-dependence of the rele- 
vances r l a (t), rf(t). 

The decomposition of the utility functions f/ a (i, t) and U a (i, t) is especially inte- 
resting with regard to prognoses. However, the prognosis of election results 
is connected with some complications [52]: 

• The voters’ decisions depend on the expected election results, especially 
on predictable coalitions. 

• Since the voters make a decision not only for a short time but for a 
whole parliamentary term, they include in their decision procedure 
the expectations about the future change of the discrepancies D l a (t) 
and of the dissatisfaction d l a (i,t) with respect to the different affairs /. 

• Of great importance are also political events of the day which are 
hardly predictable. 

In the case of purchase decisions comparable complications are normally un- 
important. Consequently, for these better results can be expected. 



11.6.3 Gaps in the Market and Foundations of New Par- 
ties 



On the one hand, after the introduction of a new product or the foundation of 
a new party i := 5 + 1 at the time to, new prognoses for the most probable 
purchase pattern or voting behaviour might become necessary. On the other 
hand, however, our models from Sections 11.6.1 and 11.6.2 allow the estimation 
of the achievable market share of a new product or the attainable number 
of foHowers of a new party. With this aim in view, we calculate the solution 
P a (S 4- l,f) of BOLTZMANN-like equations (7.62) on the basis of the predicted 
temporal evolution of the utility functions U a {i, t) and U a {i , t). In order to take 
into account the non- eligibility of the product or the party 5+1 before the 
time of appearance, we formally set 



U a (S + l,f) := -oo 



and 



U a (S+ 1,0 :=-oo 



for t < to . (11.87) 



For reasons of uniqueness and continuity of the utility functions we choose the 
constraints 





for t < to 



(11.88a) 




